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BACKGROUND OF THE INVENTION 

Field nf th<» Invention 

The present invention relates generally to the field of 
molecular biology. More specifically, the present invention relates to 
molecular cloning and characterization of homologous 28-kDa 
protein genes in Ehrlichia canis and a multigene locus encoding the 
28-kDa homologous proteins of Ehrlichia canis and uses thereof. 



Description nf the Rel ated Art 

Canine ehrlichiosis, also known as canine tropical 
pancytopenia, is a tick-borne rickettsial disease of dogs first described 
in Africa in 1935 and the United States in 1963 (Donatien and 
20 Lestoquard. 1935; Ewing, 1963). The disease became better 
recognized after an epizootic outbreak occurred in United States 
military dogs during the Vietnam War (Walker et a/., 1970) 

The etiologic agent of canine ehrlichiosis is Ehrlichia canis, 
a small, gram-negative, obligate intracellular bacterium which exhibits 
25 tropism for mononuclear phagocytes (Nyindo et aL, 1971) and is 
transmitted by the brown dog tick, Rhipicephalus sanguineus (Groves 
et at., 1975). The progression of canine ehrlichiosis occurs in three 
phases, acute, subclinical and chronic. The acute phase is 
characterized by fever, anorexia, depression, lymphadenopathy and 
30 mild thrombocytopenia (Troy and Forrester. 1990). Dogs typically 
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recover from ihe acute phase, but become persistently infected 
carriers of the organism without clinical signs of disease for months 
or even years (Harrus et al, 1998). A chronic phase develops in 
some cases that is characterized by thrombocytopenia, 
hyperglobulinemia, anorexia, emaciation, and hemorrhage, 
particularly epistaxis, followed by death (Troy and Forrester, 1990). 

Molecular taxonomic analysis based on the 16S rRNA gene 
has determined that £ canis and £ chaffeensis, the etiologic agent of 
human monocytic ehrlichiosis (HME), are closely related (Anderson et 
al, 1991; Anderson et al, 1992; Dawson et al, 1991; Chen et al.. 
1994). Considerable cross reactivity of the 64, 47, 40, 30, 29 and 23- 
kDa antigens between E. canis and £ chaff eensis has been reported 
(Chen et al, 1994; Chen et al, 1997; Rikihisa et al, 1994; Rikihisa et 
al, 1992). Analysis of immunoreactive antigens with human and 
canine convalescent phase sera by immunoblot has resulted in the 
identification of numerous immunodominant proteins of E. canis, 
including a 30-kDa protein (Chen et al, 1997). In addition, a 30-kDa 
protein of E. canis has been described as a major immunodominant 
antigen recognized early in the immune response that is antigenically 
distinct from the 30-kDa protein of E chaff eensis (Rikihisa et al, 
1992; Rikihisa et al, 1994). Other immunodominant proteins of E 
canis with molecular masses ranging from 20 to 30-kDa have also 
been identified (Brouqui et al, 1992; Nyindo et al, 1991; Chen et al, 
1994; Chen et al, 1997). 

Recently, cloning and sequencing of a multigene family 
(omp-J) encoding proteins of 23 to 28-kDa have been described for £ 
chaffeensis (Ohashi et al. 1998). The 28-kDa immunodominant outer 
membrane protein gene (p28) of £ chaffeensis, homologous to the 
Cowdria ruminantium map- 1 gene, was cloned. Mice immunized with 
recombinant P28 were protected against challenge infection with the 
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homologous strain according to PCR analysis of periperal blood 5 days 
after challenge (Ohashi et ai., 1998). Molecular cloning of two 
similar, but nonidentical, tandemly arranged 28-kDa genes of £ canis 
homologous to £ chaffeensis omp-1 gene family and C. rumanintium 
5 map- J gene has also been reported (Reddy et al. y 1998). 

The prior art is deficient in the lack of cloning and 
characterization of new homologous 28-kDa immunorcactive protein 
genes of Ehrlichia canis and a single multigene locus containing the 
homologous 28-kDa protein genes. Further, The prior art is deficient 
0 in the lack of recombinant proteins of such immunorcactive genes of 
Ehrlichia canis. The present invention fulfills this long-standing need 
and desire in the art. 

SUMMARY OF THE INVENTION 

5 

The present invention describes the molecular cloning, 
sequencing, characterization, and expression of homologous mature 
28-kDa immunorcactive protein genes of Ehrlichia canis (designated 
Eca28-1, ECa28SA3 and ECa28SA2), and the identification of a single 
locus (5.592-kb) containing five 28-kDa protein genes of Ehrlichia 
canis (ECa28SAJ. ECa28SA2, ECa28SA3, Eca28-1 and ECa28-2). 
Comparison with £ chaffeensis and among £ canis 28-kDa protein 
genes revealed that ECa28-l shares the most amino acid homology 
with the £ chaffeensis omp-I multigene family and is highly conserved 
among £ canis isolates. The Five 28-kDa proteins were predicted to 
have signal peptides resulting in mature proteins, and had amino acid 
homology ranging from 51 to 72%. Analysis of intergenic regions 
revealed hypothetical promoter regions for each gene, suggesting that 
these genes may be independently and differentially expressed. 
Intergenic noncoding regions ranged in size from 299 to 355-bp. and 
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were 48 to 71% homologous. 

In one embodiment of the present invention, there are 
provided DNA sequences encoding a 30-kDa immunoreactive protein 
of Ehrlichia canis. Preferably, the protein has an amino acid sequence 
selected from the group consisting of SEQ ID No. 2, SEQ ID No. 4 and 
SEQIDNo. 6, and the gene has a nucleic acid sequence selected from 
the group consisting of SEQ ID No. L SEQ ID No. 3 and SEQ ID No. 5 
and is a member of a polymorphic multiple gene family. Generally, 
the protein has an N-terminal signal sequence which is cleaved after 
post-translational process resulting in the production of a mature 28- 
kDa protein. Still preferably, the DNAs encoding 28-kDa proteins are 
contained in a single multigene locus, which has the size of 5.592 kb 
and encodes all five homologous 28-kDa proteins of Ehrlichia canis. 

In another embodiment of the present invention, there is 
provided an expression vector comprising a gene encoding a 28-kDa 
immunoreactive protein of Ehrlichia canis and capable of expressing 
the gene when the vector is introduced into a cell. 

In still another embodiment of the present invention, there 
is provided a recombinant protein comprising an amino acid sequence 
selected from the group consisting of SEQ ID No. 2, SEQIDNo. 4 and 
SEQ ID No. 6. Preferably, the amino acid sequence is encoded by a 
nucleic acid sequence selected from the group consisting of SEQ ID 
No. 1, SEQ ID No. 3 and SEQ ID No. 5. Preferably, the recombinant 
protein comprises four variable regions which are surface exposed, 
hydrophilic and antigenic. The recombinant protein may be useful as 
an antigen. 

In yet another embodiment of the present invention, there 
is provided a method of producing the recombinant protein, 
comprising the steps of obtaining a vector that comprises an 
expression region comprising a sequence encoding the amino acid 
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sequence selected from the group consisting of SEQ ID No. 2. SEQ ID 
No. 4 and SEQ ID No. 6 operatively linked to a promoter; trans fee ting 
the vector into a cell; and culturing the cell under conditions effective 
for expression of the expression region. 
5 The invention may also be described in certain 

embodiments as a method of inhibiting Ehrlichia canis infection in a 
subject comprising the steps of: identifying a subject suspected of 
being exposed to or infected with Ehrlichia canis; and administering a 
composition comprising a 28-kDa antigen of Ehrlichia canis in an 
10 amount effective to inhibit an Ehrlichia canis infection. The inhibition 
may occur through any means such as, i.e. the stimulation of the 
subject's humoral or cellular immune responses, or by other means 
such as inhibiting the normal function of the 28-kDa antigen, or even 
competing with the antigen for interaction with some agent in the 
15 subject's body. 

Other and further aspects, features, and advantages of the 
present invention will be apparent from the following description of 
the presently preferred embodiments of the invention given for the 
purpose of disclosure. 



20 



BRIEF DESCRIPTION OF THE DRAWINGS 



So that the matter in which the above-recited features, 
advantages and objects of the invention, as well as others which will 
25 become clear, are attained and can be understood in detail, more 
particular descriptions of the invention briefly summarized above may 
be had by reference to certain embodiments thereof which are 
illustrated in the appended drawings. These drawings form a part of 
the specification. It is to be noted, however, that the appended 
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drawings illustrate preferred embodiments of the invention and 
therefore are not to be considered limiting in their scope. 

Figure 1 shows nucleic acid sequence (SEQ ID No. 1) and 
deduced amino acid sequence (SEQ ID No. 2) of ECa28-I gene 
5 including adjacent 5\ and 3' non-coding sequences. The ATG start 
codon and TAA termination are shown in bold, and the 23 amino acid 
leader signal sequence is underlined. 

Figure 2 shows SDS-PAGE of expressed 50-kDa 
recombinant ECa28-l -thioredoxin fusion protein (Lane 1, arrow) and 
16-kDa thioredoxin control (Lane 2, arrow), and corresponding 
immunoblot of recombinant ECa28-l -thioredoxin fusion protein 
recognized by covalescent-phase £ canis canine antiserum (Lane 3). 
Thiroredoxin control was not detected by E.canis antiserum (not 
shown). 

Figure 3 shows alignment of ECa28-l protein (SEQ ID NO. 
2), and ECa28SA2 (partial sequence, SEQ ID NO. 7) and ECa28SAI (SEQ 
ID NO. 8), £ chaff eensis P28 (SEQ ID NO. 9), E chaff eensis OMP-1 
family (SEQ ID NOs: 10-14) and C. ruminantium MAP-1 (SEQ ID NO. 
15) amino acid sequences. The ECa28-l amino acid sequence is 
presented as the consensus sequence. Amino acids not shown are 
identical to ECa28-I and are represented by a dot. Divergent amino 
acids are shown with the corresponding one letter abbreviation. Gaps 
introduced for maximal alignment Of the amino acid sequences are 
denoted with a dash. Variable regions are underlined and denoted 
(VRL VR2, VR3, and VR4). The arrows indicate the predicted signal 
peptidase cleavage site for the signal peptide. 

Figure 4 shows phylogenetic relatedness of E. canis ECa28- 
1 with the ECa28SA2 (partial sequence) and ECa28SAl, 6 members of 
the E.chaffeensis omp-1 multiple gene family, and C. rumanintium 
map- 1 from deduced amino acid sequences utilizing unbalanced tree 
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construction. The length of each pair of branches represents the 
distance between the amino acid sequence of the pairs. The scaJe 
measures the distance between sequences. 

Figure 5 shows Southern blot analysis of £ cants genomic 
DNA completely digested with six individual restriction enzymes and 
hybridized with a ECa28-l DIG-labeied probe (Lanes 2-7); DIG-labeled 
molecular weight markers (Lanes 1 and 8). 

Figure 6 shows comparison of predicted protein 
characteristics of ECa28-l (Jake strain) and E chaffeensis P28 
(Arkansas strain). Surface probability predicts the surface residues 
by using a window of hexapeptide. A surface residue is any residue 
with a >2.0 nm 2 of water accessible surface area. A hexapeptide with 
a value higher than 1 was considered as surface region. The antigenic 
index predicts potential antigenic determinants. The regions with a 
15 value above zero are potential antigenic determinants. T-cell motif 
locates the potential T-cell antigenic determinants by using a motif of 
5 amino acids with residue 1-glycine or polar, residue 2-hydrophobic, 
residue 3 -hydrophobic, residue 4-hydrophobic or proline, and residue 
5-polar or glycine. The scale indicates amino acid positions. 
:0 Figure 7 shows nucleic acid sequences and deduced 

amino acid sequences of the £ canis 28-kDa protein genes ECa28SA2 
(nucleotide 1-849: SEQ ID No. 3; amino acid sequence: SEQ ID No. 4) 
and ECa28SA3 (nucleotide 1195-2031: SEQ ID No. 5; amino acid 
sequence: SEQ ID No. 6) including intergenic noncoding sequences 
5 (NC2, nucleotide 850-1194: SEQ ID No. 31). The ATG start codon and 
termination condons are shown in bold. 

Figure 8 shows schematic of the five £ canis 28-kDa 
protein gene locus (5.592-Kb) indicating genomic orientation and 
intergenic noncoding regions (28NC1-4). The 28-kDa protein genes 
3 shown in Locus I and 2 (shaded) have been described (McBride et al.. 
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1999; Reddy ei aL. 1998: Ohashi et ah. 1998). The complete sequence 
of ECaSA2 and a new 28-kDa protein gene designated {ECa28SA3 - 
unshaded) was sequenced. The noncoding intergenic regions (28NC2- 
3) between ECaSA2. ECa28SA3 and ECa28-I were completed joining the 
5 previously unlinked loci I and 2. 

Figure 9 shows phylogenetic relatedness of the five E 
canis 28-kDa protein gene members based on amino acid sequences 
utilizing unbalanced tree construction. The length of each pair of 
branches represents the distance between amino acid pairs. The scale 
10 measures the distance beteween sequences. 

Figure 10 shows alignment of E. canis 28-kDa protein 
gene intergenic noncoding nucleic acid sequences (SEQ ID Nos. 30- 
33). Nucleic acids not shown, denoted with a dot (..), are identical to 
noncoding region 1 (28NC1). Divergence is shown with the 
15 corresponding one letter abbreviation. Gaps introduced for maximal 
alignment of the amino acid sequences are denoted with a dash (-). 
Putative transcriptional promoter regions (-10 and -35) and ribosomal 
binding site (RBS) are boxed. 

20 DETAILED DESCRIPTION OF THE INVENTION 

The present invention describes cloning, sequencing and 
expression of homologous genes encoding a 30-kilodalton (kDa) 
protein of Ehrlichia canis. A comparative molecular analysis of 

25 homologous genes among seven E. canis isolates and the E chaffeensis 
omp-1 multigenc family was also performed. Two new 28-kDa protein 
genes are identified, ECa28-l and ECa28SA3. ECa28-l has an 834-bp 
open reading frame encoding a protein of 278 amino acids (SEQ ID 
No. 2) with a predicted molecular mass of 30.5-kDa. An N-terminal 

30 signal sequence was identified suggesting that the protein is post- 

8 



WO 00/32745 PCT/US99/28075 

transiationally modified to a mature protein of 27.7-kDa. ECd28SA3 
has an 840-bp open reading frame encoding a 280 amino acid protein 
(SEQ ID No. 6). 

Using PCR to amplify 28-kDa protein genes of E. cants, a 
previously unsequenced region of Eca28SA2 was completed. Sequence 
analysis of ECa28SA2 revealed an 849-bp open reading frame encoding 
a 283 amino acid protein (SEQ ID No. 4). PCR amplification using 
primers specific for 28-kDa protein gene intergenic noncoding regions 
linked two previously separate loci, identifying a single locus (5.592- 
kb) containing all five 28-kDa protein genes. The five 28-kDa 
proteins were predicted to have signal peptides resulting in mature 
proteins, and had amino acid homology ranging from 51 to 72%. 
Analysis of intergenic regions revealed hypothetical promoter regions 
for each gene, suggesting that these genes may be independently and 
differentially expressed. Intergenic noncoding regions (28NC1-4) 
ranged in size from 299 to 355-bp, and were 48 to 71% homologous. 

The present invention is directed to two new homologous 
28-kDa protein genes in Ehrlichia canis, Eca28-l and ECa28SA3, and a 
complete sequence of previously partially sequenced ECa28SA2. Also 
disclosed is a multigene locus encoding all five homologous 28-kDa 
outer membrane proteins of Ehrlichia canis. 

In one embodiment of the present invention, there are 
provided DNA sequences encoding a 30-kDa immunoreactive protein 
of Ehrlichia canis. Preferably, the protein has an amino acid sequence 
selected from the group consisting of SEQ ID No. 2, SEQ ID No. 4 and 
SEQ ID No. 6. and the gene has a nucleic acid sequence selected from 
the group consisting of SEQ ID No. I. SEQ ID No. 3 and SEQ ID No. 5 
and is a member of a polymorphic multiple gene family. More 
preferably, the protein has an N-terminal signal sequence which is 
cleaved after post-translational process resulting in the production of 
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a mature 28-kDa protein. Still preferably, the DNAs encoding 28-kDa 
proteins are contained in a single multigene locus, which has the size 
of 5.592 kb and encodes all five homologous. 28-kDa proteins of 
Ehrlichia canis. 

5 In another embodiment of the present invention, there is 

provided an expression vector comprising a gene encoding a 28-kDa 
immunoreactive protein of Ehrlichia canis and capable of expressing 
the gene when the vector is introduced into a cell. 

In still another embodiment of the present invention, there 
0 is provided a recombinant protein comprising an amino acid sequence 
selected from the group consisting of SEQ ID No. 2. SEQ ID No. 4 and 
SEQ ID No. 6. Preferably, the amino acid sequence is encoded by a 
nucleic acid sequence selected from the group consisting of SEQ ID 
No. 1, SEQ ID No. 3 and SEQ ID No. 5. Preferably, the recombinant 
5 protein comprises four variable regions which are surface exposed, 
hydrophilic and antigenic. Still preferably, the recombinant protein is 
an antigen. 

In yet another embodiment of the present invention, there 
is provided a method of producing the recombinant protein, 
comprising the steps of obtaining a vector that comprises an 
expression region comprising a sequence encoding the amino acid 
sequence selected from the group consisting of SEQ ID No. 2, SEQ ID 
No. 4 and SEQ ID No. 6 operatively linked to a promoter; transfecting 
the vector into a cell; and culturing the cell under conditions effective 
for expression of the expression region. 

The invention may also be described in certain 
embodiments as a method of inhibiting Ehrlichia canis infection in a 
subject comprising the steps of: identifying a subject suspected of 
being exposed to or infected with Ehrlichia canis: and administering a 
composition comprising a 28-kDa antigen of Ehrlichia canis in an 

10 
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amount effective to inhibit an Ehrlichia cams infection. The inhibition 
may occur through any means such as, i.e. the stimulation of the 
subject's humoral or cellular immune responses, or by other means 
such as inhibiting the normal function of the 28-kDa antigen, or even 
5 competing with the antigen for interaction with some agent in the 
subject's body. 

In accordance with the present invention there may be 
employed conventional molecular biology, microbiology, and 
recombinant DNA techniques within the skill of the an. Such 

10 techniques are explained fully in the literature. See, e.g., Maniatis, 
Fritsch & Sambrook. "Molecular Cloning: A Laboratory Manual (1982); 
"DNA Cloning: A Practical Approach." Volumes I and II (D.N. Glover ecL 
1985); "Oligonucleotide Synthesis" (M.J. Gait ed. 1984); "Nucleic Acid 
Hybridization" [B.D. Hames & S.J. Higgins eds. (1985)]; "Transcription 

15 and Translation" [B.D. Hames & S.J. Higgins eds. (1984)]; "Animal Cell 
Culture" [R.I. Freshney, ed. (1986)]; "Immobilized Cells And Enzymes" 
[IRL Press, (1986)]; B. Perbal, "A Practical Guide To Molecular Cloning" 
(1984). 

Therefore, if appearing herein, the following terms shall 
20 have the definitions set out below. 

A "replicon" is any genetic element (e.g., plasmid, 
chromosome, virus) that functions as an autonomous unit of DNA 
replication in vivo; i.e., capable of replication under its own control. 

A "vector" is a replicon. such as plasmid, phage or cosmid. 
25 to which another DNA segment may be attached so as to bring about 
the replication of the attached segment. 

A "DNA molecule" refers to the polymeric form of 
deoxyribonucleotides (adenine, guanine, thymine, or cytosine) in its 
either single stranded form, or a double-stranded helix. This term 
30 refers only to the primary and secondary structure of the molecule. 

II 
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and does not limit it to any particular tertiary forms. Thus, this term 
includes double-stranded DNA found, inter alia, in linear DNA 
molecules (e.g., restriction fragments), viruses, plasmids. and 
chromosomes. In discussing the structure herein according to the 
normal convention of giving only the sequence in the 5' to 3' direction 
along the nontranscribed strand of DNA (i.e., the strand having a 
sequence homologous to the mRNA). 

A DNA "coding sequence" is a double-stranded DNA 
sequence which is transcribed and translated into a polypeptide in 
vivo when placed under the control of appropriate regulatory 
sequences. The boundaries of the coding sequence are determined by 
a start codon at the 5" (amino) terminus and a translation stop codon 
at the 3' (carboxyl) terminus. A coding sequence can include, but is 
not limited to, prokaryotic sequences, cDNA from eukaryotic mRNA, 
genomic DNA sequences from eukaryotic (e.g., mammalian) DNA, and 
even synthetic DNA sequences. A polyadenylation signal and 
transcription termination sequence will usually be located 3' to the 
coding sequence. 

Transcriptional and translational control sequences arc 
DNA regulatory sequences. such as promoters. enhancers, 
polyadenylation signals, terminators, and the like, that provide for the 
expression of a coding sequence in a host cell. 

A "promoter sequence" is a DNA regulatory region capable 
of binding RNA polymerase in a cell and initiating transcription of a 
downstream (3 1 direction) coding sequence. For purposes of defining 
the present invention, the promoter sequence is bounded at its 3 ' 
terminus by the transcription initiation site and extends upstream (5* 
direction) to include the minimum number of bases or elements 
necessary to initiate transcription at levels detectable above 
background. Within the promoter sequence will be found a 
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transcription initiation site, as well as protein binding domains 
(consensus sequences) responsible for the binding of RNA 
polymerase. Eukaryotic promoters often, but not always, contain 
"TATA" boxes and "CAT" boxes. Prokaryotic promoters contain Shine- 
Dalgarno sequences in addition to the -10 and -35 consensus 
sequences. 

An "expression control sequence" is a DNA sequence that 
controls and regulates the transcription and translation of another 
DNA sequence. A coding sequence is "under the control" of 
transcriptional and translational control sequences in a cell when RNA 
polymerase transcribes the coding sequence into mRNA, which is then 
translated into the protein encoded by the coding sequence. 

A "signal sequence" can be included near the coding 
sequence. This sequence encodes a signal peptide, N-terminal to the 
polypeptide, that communicates to the host cell to direct the 
polypeptide to the cell surface or secrete the polypeptide into the 
media, and this signal peptide is clipped off by the host cell before the 
protein leaves the cell. Signal sequences can be found associated with 
a variety of proteins native to prokaryotes and eukaryotes. 

The term "oligonucleotide", as used herein in referring to the 
probe of the present invention, is defined as a molecule comprised of 
two or more ribonucleotides, preferably more than three. Its exact 
size will depend upon many factors which, in turn, depend upon the 
ultimate function and use of the oligonucleotide. 

The term "primer" as used herein refers to an 
oligonucleotide, whether occurring naturally as in a purified 
restriction digest or produced synthetically, which is capable of acting 
as a point of initiation of synthesis when placed under conditions in 
which synthesis of a primer extension product, which is 
complementary to a nucleic acid strand, is induced, i.e.. in the 
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presence of nucleotides and an inducing agent such as a DNA 
polymerase and at a suitable temperature and pH. The primer may be 
either single-stranded or double-stranded and must be sufficiently 
long to prime the synthesis of the desired extension product in the 
presence of the inducing agent. The exact length of the primer will 
depend upon many factors, including temperature, source of primer 
and use the method. For example, for diagnostic applications, 
depending on the complexity of the target sequence, the 
oligonucleotide primer typically contains 15-25 or more nucleotides, 
although it may contain fewer nucleotides. 

The primers herein are selected to be "substantially" 
complementary to different strands of a particular target DNA 
sequence. This means that the primers must be sufficiently 
complementary to hybridize with their respective strands. Therefore, 
the primer sequence need not reflect the exact sequence of the 
template. For example, a non-complementary nucleotide fragment 
may be attached to the 5' end of the primer, with the remainder of the 
primer sequence being complementary to the strand. Alternatively, 
non-complementary bases or longer sequences can be interspersed 
into the primer, provided that the primer sequence has sufficient 
complementarity with the sequence or hybridize therewith and 
thereby form the template for the synthesis of the extension product. 

A cell has been "transformed" by exogenous or 
heterologous DNA when such DNA has been introduced inside the cell. 
The transforming DNA may or may not be integrated (covalently 
linked) into the genome of the ceil. In prokaryotcs, yeast, and 
mammalian cells for example, the transforming DNA may be 
maintained on an episomal element such as a piasmid. With respect 
to eukaryotic cells, a stably transformed cell is one in which the 
transforming DNA has become integrated into a chromosome so that 

14 
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it is inherited by daughter cells through chromosome replication. 
This stability is demonstrated by the ability of the eukaryotic cell to 
establish cell lines or clones comprised of a population of daughter 
cells containing the transforming DNA. A "clone" is a population of 
cells derived from a single cell or ancestor by mitosis. A "cell line" is 
a clone of a primary cell that is capable of stable growth in vitro for 
many generations. 

Two DNA sequences are "substantially homologous" when 
at least about 75% (preferably at least about 80%, and most 
preferably at least about 90% or 95%) of the nucleotides match over 
the defined length of the DNA sequences. Sequences that are 
substantially homologous can be identified by comparing the 
sequences using standard software available in sequence data banks, 
or in a Southern hybridization experiment under, for example, 
stringent conditions as defined for that particular system. Defining 
appropriate hybridization conditions is within the skill of the art. See, 
e.g., Maniatis et ah, supra; DNA Cloning, Vols. I & H, supra; Nucleic 
Acid Hybridization, supra. 

A "heterologous' region of the DNA construct is an 
identifiable segment of DNA within a larger DNA molecule that is not 
found in association with the larger molecule in nature. Thus, when 
the heterologous region encodes a mammalian gene, the gene will 
usually be flanked by DNA that does not flank the mammalian 
genomic DNA in the genome of the source organism. In another 
example, coding sequence is a construct where the coding sequence 
itself is not found in nature (e.g., a cDNA where the genomic coding 
sequence contains introns, or synthetic sequences having codons 
different than the native gene). Allelic variations or naturally- 
occurring mutational events do not give rise to a heterologous region 
of DNA as defined herein. 

15 
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The labels most commonly employed for these studies are 
radioactive elements, enzymes, chemicals which fluoresce when 
exposed to untraviolet light, and others. A number of fluorescent 
materials are known and can be utilized as labels. These include, for 
5 example, fluorescein, rhodamine, auramine, Texas Red, AMCA blue 
and Lucifer Yellow. A particular detecting material is anti-rabbit 
antibody prepared in goats and conjugated with fluorescein through 
an isothiocyanate. 

Proteins can also be labeled with a radioactive element or 
10 with an enzyme. The radioactive label can be detected by any of the 
currently available counting procedures. The preferred isotope may 
be selected from ^H, \*C, 32p t 355, 36q, 51Q-, ^C 0i 58c 0 , 59p e , 90y, 125^ 
1.311, and i86Re. 

Enzyme labels are likewise useful, and can be detected by 
15 any of the presently utilized colorimetric, spectrophotometric, 
fluorospectrophotometric, amperometric or gasometric techniques. 
The enzyme is conjugated to the selected particle by reaction with 
bridging molecules such as carbodiimides, diisocyanates, 
glutaraldehyde and the like. Many enzymes which can be used in 
20 these procedures are known and can be utilized. The preferred are 
peroxidase, {J-glucuronidase, P-D-glucosidase, {J-D-galactosidase, 
urease, glucose oxidase plus peroxidase and alkaline phosphatase. 
U.S. Patent Nos. 3,654,090, 3,850,752, and 4,016,043 are referred to 
by way of example for their disclosure of alternate labeling material 
25 and methods. 

As used herein, the term "host" is meant to include not 
only prokaryotes but also eukaryotes such as yeast, plant and animal 
cells. A recombinant DN A molecule or gene which encodes a 28-kDa 
immunoreactive protein of Ehrlichia canis of the present invention can 
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be used to transform a host using any of the techniques commonly 
known to those of ordinary skill in the art. Especially preferred is the 
use of a vector containing coding sequences for a gene encoding a 28- 
kDa immunoreactive protein of Ehrlichia canis of the present 
invention for purposes of prokaryote transformation. 

Prokaryotic hosts may include E coli y S. tymphimurium, 
Serratia marcescens and Bacillus subtilis. Eukaryotic hosts include 
yeasts such as Pickia pastoris, mammalian cells and insect cells. 

In general, expression vectors containing promoter 
sequences which facilitate the efficient transcription of the inserted 
DNA fragment are used in connection with the host. The expression 
vector typically contains an origin of replication, promoter(s), 
terminator(s), as well as specific genes which are capable of providing 
phenotypic selection in transformed cells. The transformed hosts can 
be fermented and cultured according to means known in the art to 
achieve optimal cell growth. 

The invention includes a substantially pure DNA encoding 
a 28-kDa immunoreactive protein of Ehrlichia canis , a strand of which 
DNA will hybridize at high stringency to a probe containing a 
sequence of at least 15 consecutive nucleotides of SEQ ID No. 1 or SEQ 
ID No. 3 or SEQ ID No. 5. The protein encoded by the DNA of this 
invention may share at least 80% sequence identity (preferably 85 %, 
more preferably 90%, and most preferably 95%) with the amino acids 
listed in SEQ ID No. 2 or SEQ ID No. 4 or SEQ ID No. 6. More 
preferably, the DNA includes the coding sequence of the nucleotides 
of SEQ ED No. 1 or SEQ ID No. 3 or SEQ ID No. 5, or a degenerate 
variant of such a sequence. 

The probe to which the DNA of the invention hybridizes 
preferably consists of a sequence of at least 20 consecutive 
nucleotides, more preferably 40 nucleotides, even more preferably 5 0 
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nucleotides, and most preferably 100 nucleotides or more (up to 
100%) of the coding sequence of the nucleotides listed in SEQ ID No. 1 
or SEQ ID No. 3 or SEQ ID No. 5 or the complement thereof. Such a 
probe is useful for detecting expression of the 28-kDa 
5 immunoreactive protein of Ehrlichia canis in a human cell by a 
method including the steps of (a) contacting mRN A obtained from the 
cell with the labeled hybridization probe; and (b) detecting 
hybridization of the probe with the mRNA. 

This invention also includes a substantially pure DNA 

10 containing a sequence of at least 15 consecutive nucleotides 
(preferably 20, more preferably 30, even more preferably 50, and 
most preferably all) of the region from the nucleotides listed in SEQ ID 
No 1 or SEQ ID No. 3 or SEQ ID No. 5. 

By "high stringency" is meant DNA hybridization and wash 

15 conditions characterized by high temperature and low salt 
concentration, e.g., wash conditions of 65°C at a salt concentration of 
approximately 0.1 x SSC, or the functional equivalent thereof. For 
example, high stringency conditions may include hybridization at 
about 42°C in the presence of about 50% formamide; a first wash at 

20 about 65°C wilh about 2 x SSC containing 1% SDS; followed by a 
second wash at about 65°C with about 0.1 x SSC. 

By "substantially pure DNA" is meant DNA that is not part 
of a milieu in which the DNA naturally occurs, by virtue of separation 
(partial or total purification) of some or all of the molecules ^pt that 

25 milieu, or by virtue of alteration of sequences that flank the claimed 
DNA. The term therefore includes, for example, a recombinant DNA 
which is incorporated into a vector, into an autonomously replicating 
plasmid or virus, or into the genomic DNA of a prokaryote or 
eukaryote; or which exists as a separate molecule (e.g., a cDNA or a 

30 genomic or cDNA fragment produced by polymerase chain reaction 
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(PCR) or restriction endonuclease digestion) independent of other 
sequences. It also includes a recombinant DNA which is part of a 
hybrid gene encoding additional polypeptide sequence, e.g., a fusion 
protein. Also included is a recombinant DNA which includes a portion 
5 Of the nucleotides listed in SEQ ID No. 1 or SEQ ID No. 3 or SEQ ID No. 
5 which encodes an alternative splice variant of a gene encoding a 28- 
kDa immunoreactive protein of Ehrlichia canis. 

The DNA may have at least about 70% sequence identity to 
the coding sequence of the nucleotides listed in SEQ ID No. 1 or SEQ ID 

10 No. 3 or SEQ ID No. 5, preferably at least 75% (e.g. at least 80%); and 
most preferably at least 90%. The identity between two sequences is a 
direct function of the number of matching or identical positions. 
When a subunit position in both of the two sequences is occupied by 
the same monomeric subunit, e.g., if a given position is occupied by 

15 an adenine in each of two DNA molecules, then they are identical at 
that position. For example, if 7 positions in a sequence 
10 nucleotides in length are identical to the corresponding positions 
in a second 10-nucleotide sequence, then the two sequences have 70% 
sequence identity. The length of comparison sequences will generally 

20 be at least 50 nucleotides, preferably at least 60 nucleotides, more 
preferably at least 75 nucleotides, and most preferably 
100 nucleotides. Sequence identity is typically measured using 
sequence analysis software (e.g., Sequence Analysis Software Package 
of the Genetics Computer Group, University of Wisconsin 

25 Biotechnology Center, 1710 University Avenue, Madison, Wr 53705). 

The present invention comprises a vector comprising a 
DNA sequence coding for a which encodes a gene encoding a 28-kDa 
immunoreactive protein of Ehrlichia canis and said vector is capable 
of replication in a host which comprises, in operable linkage: a) an 

30 origin of replication; b) a promoter; and c) a DNA sequence coding 
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for said protein. Preferably, the vector of the present invention 
contains a portion of the DNA sequence shown in SEQ ID No. 1 or SEQ 
ID No. 3 or SEQ ID No. 5. 

A "vector" may be defined as a replicable nucleic acid 
5 construct, e.g., a plasmid or viral nucleic acid. Vectors may be used 
to amplify and/or express nucleic acid encoding a 28-kDa 
immunoreactive protein of Ehrlichia canis. An expression vector is a 
replicable construct in which a nucleic acid sequence encoding a 
polypeptide is operably linked to suitable control sequences capable 

10 of effecting expression of the polypeptide in a cell. The need for such 
control sequences will vary depending upon the cell selected and the 
transformation method chosen. Generally, control sequences include 
a transcriptional promoter and/or enhancer, suitable mRNA ribosomal 
binding sites, and sequences which control the termination of 

15 transcription and translation. Methods which are well known to those 
skilled in the art can be used to construct expression vectors 
containing appropriate transcriptional and translational control 
signals. See for example, the techniques described in Sambrook et ah, 
1989, Molecular Cloning: A Laboratory Manual (2nd Ed.), Cold Spring 

20 Harbor Press, N.Y. A gene and its transcription control sequences are 
defined as being "operably linked" if the transcription control 
sequences effectively control the transcription of the gene. Vectors of 
the invention include, but are not limited to, plasmid vectors and viral 
vectors. Preferred viral vectors of the invention are those derived 

25 from retroviruses, adenovirus, adeno-associated virus, SV40 virus, or 
herpes viruses. 

By a "substantially pure protein" is meant a protein which 
has been separated from at least some of those components which 
naturally accompany it. Typically, the protein is substantially pure 

30 when it is at least 60%, by weight, free from the proteins and other 
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naturally-occurring organic molecules with which it is naturally 
associated in vivo. Preferably, the purity of the preparation is at least 
75%, more preferably at least 90%, and most preferably at least 99%, 
by weight. A substantially pure 28-kDa immunoreactive protein of 
Ehrlichia can is may be obtained, for example, by extraction from a 
natural source; by expression of a recombinant nucleic acid encoding 
a 28-kDa immunoreactive protein of Ehrlichia canis; or by chemically 
synthesizing the protein. Purity can be measured by any appropriate 
method, e.g., column chromatography such as immunoaffinity 
chromatography using an antibody specific for a 28-kDa 
immunoreactive protein Of Ehrlichia canis, polyacrylamide gel 
electrophoresis, or HPLC analysis. A protein is substantially free of 
naturally associated components when it is separated from at least 
some of those contaminants which accompany it in its natural state. 
Thus, a protein which is chemically synthesized or produced in a 
cellular system different from the cell from which it naturally 
originates will be, by definition, substantially free from its naturally 
associated components. Accordingly, substantially pure proteins 
include eukaryotic proteins synthesized in E. coli, other prokaryotes, 
or any other organism in which they do not naturally occur. 

In addition to substantially full-length proteins, the 
invention also includes fragments (e.g., antigenic fragments) of the 
28-kDa immunoreactive protein of Ehrlichia canis (SEQ ID No. 2 or 
SEQ ID No. 4 or SEQ ID No. 6). As used herein, -fragment," as applied 
to a polypeptide, will ordinarily be at least 10 residues, more typically 
at least 20 residues, and preferably at least 30 (e.g., 50) residues in 
length, but less than the entire, intact sequence. Fragments of the 28- 
kDa immunoreactive protein of Ehrlichia canis can be generated by 
methods known to those skilled in the art, e.g., by enzymatic digestion 
of naturally occurring or recombinant 28-kDa immunoreactive 
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protein of Ehrlichia canis + by recombinant DNA techniques using an 
expression vector that encodes a defined fragment of 28-kDa 
immunoreactive protein of Ehrlichia canis, or by chemical synthesis. 
The ability of a candidate fragment to exhibit a characteristic of 28- 
5 kDa immunoreactive protein of Ehrlichia canis (e.g., binding to an 
antibody specific for 28-kDa immunoreactive protein of Ehrlichia 
canis) can be assessed by methods described herein. Purified 28-kDa 
immunoreactive protein of Ehrlichia canis or antigenic fragments of 
28-kDa immunoreactive protein of Ehrlichia canis can be used to 

10 generate new antibodies or to test existing antibodies (e.g., as positive 
controls in a diagnostic assay) by employing standard protocols 
known to those skilled in the art. Included in this invention are 
polyclonal antisera generated by using 28-kDa immunoreactive 
protein of Ehrlichia canis or a fragment of 28-kDa immunoreactive 

15 protein of Ehrlichia canis as the immunogen in, e.g., rabbits. Standard 
protocols for monoclonal and polyclonal antibody production known 
to those skilled in this art are employed. The monoclonal antibodies 
generated by this procedure can be screened for the ability to identify 
recombinant Ehrlichia canis cDNA clones, and to distinguish them 

20 from known cDNA clones. 

Further included in this invention are fragments of the 28- 
kDa immunoreactive protein of Ehrlichia canis which are encoded at 
least in part by portions of SEQ ID No. 1 or SEQ ID No. 3 or SEQ ED No. 
5, e.g., products of alternative mRNA splicing or alternative protein 

25 processing events, or in which a section of the sequence has been 
deleted. The fragment, or the intact 28-kDa immunoreactive protein 
of Ehrlichia canis, may be covalently linked to another polypeptide,, 
e.g. which acts as a label, a ligand or a means to increase antigenicity. 

The phrase "pharmaceutically acceptable" refers to 

30 molecular entities and compositions that do not produce an allergic 
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or similar untoward reaction when administered to a human. The 
preparation of an aqueous composition that contains a protein as an 
active ingredient is well understood in the art. Typically, such 
compositions are prepared as injectables, either as liquid solutions or 
5 suspensions; solid forms suitable for solution in, or suspension in, 
liquid prior to injection can also be prepared. The preparation can 
also be emulsified. 

A protein may be formulated into a composition in a 
neutral or salt form. Pharmaceutically acceptable salts, include the 

10 acid addition salts (formed with the free amino groups of the protein) 
and which are formed with inorganic acids such as, for example, 
hydrochloric or phosphoric acids, or such organic acids as acetic, 
oxalic, tartaric, mandelic, and the like. Salts formed with the free 
carboxyl groups can also be derived from inorganic bases such as, for 

15 example, sodium, potassium, ammonium, calcium, or ferric 
hydroxides, and such organic bases as isopropylamine, 
trimethylamine, histidine, procaine and the like. 

Upon formulation, solutions will be administered in a 
manner compatible with the dosage formulation and in such amount 

20 as is therapeutically effective. The formulations are easily 
administered in a variety of dosage forms such as injectable solutions. 

For parenteral administration in an aqueous solution, for 
example, the solution should be suitably buffered if necessary and the 
liquid diluent first rendered isotonic with sufficient saline or glucose. 

25 These particular aqueous solutions are especially suitable for 
intravenous, intramuscular, subcutaneous and intraperitoneal 
administration. In this connection, sterile aqueous media which can 
be employed will be known to those of skill in the art in light of the 
present disclosure. For example, one dosage could be dissolved in 1 

3X) mL of isotonic NaCl solution and either added to lOOOmL of 
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hypodermoclysis fluid or injected at the proposed site of infusion, 
(see for example, "Remington's Pharmaceutical Sciences" 15th Edition, 
pages 1035-1038 and 1570-1580). Some variation in dosage will 
necessarily occur depending on the condition of the subject being 
5 treated. The person responsible for administration will, in any event, 
determine the appropriate dose for the individual subject. 

As is well known in the art, a given polypeptide may vary 
in its immunogenicity. It is often necessary therefore to couple the 
immunbgen (e.g., a polypeptide of the present invention) with a 
10 carrier. Exemplary and preferred carriers are keyhole limpet 
hemocyanin (KLH) and human serum albumin. Other carriers may 
include a variety of lymphokines and adjuvants such as IL2, IL4, IL8 
and others. 

Means for conjugating a polypeptide to a carrier protein 

15 are well known in the art and include glutaraldehyde, m- 
maleimidobenzoyl-N-hydroxysuccinimide ester, carbo-diimide and 
bis-biazotized benzidine. It is also understood that the peptide may 
be conjugated to a protein by genetic engineering techniques that are 
well known in the art. 

20 As is also well known in the art, immunogenicity to a 

particular immunogen can be enhanced by the use of non-specific 
stimulators of the immune response known as adjuvants. Exemplary 
and preferred adjuvants include complete BCG, Detox, (RIBI, 
Immunochem Research Inc.) ISCOMS and aluminum hydroxide 

25 adjuvant (Superphos, Biosector). 

As used herein the term "complement" is used to define 
the strand of nucleic acid which will hybridize to the first nucleic acid 
sequence to form a double stranded molecule under stringent 
conditions. Stringent conditions are those that allow hybridization 

30 between two nucleic acid sequences with a high degree of homology, 
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but precludes hybridization of random sequences. For example, 
hybridization at low temperature and/or high ionic strength is termed 
low stringency and hybridization at high temperature and/or low ionic 
strength is termed high stringency. The temperature and ionic 
5 strength of a desired stringency are understood to be applicable to 
particular probe lengths, to the length and base content of the 
sequences and to the presence of formamide in the hybridization 
mixture. 

As used herein, the term "engineered" or "recombinant" 
10 cell is intended to refer to a cell into which a recombinant gene, such 
as a gene encoding an Ehrlichia chaff eensis antigen has been 
introduced. Therefore, engineered cells are distinguishable from 
naturally occurring cells which do not contain a recombinantly 
introduced gene. Engineered cells are thus cells having a gene or 
15 genes introduced through the hand of man. Recombinantly 
introduced genes will either be in the form of a cDNA gene, a copy of 
a genomic gene, or will include genes positioned adjacent to a 
promoter not naturally associated with the particular introduced 
gene. In addition, the recombinant gene may be integrated into the 
20 host genome, or it may be contained in a vector, or in a bacterial 
genome transfected into the host cell. 

The following examples are given for the purpose of 
illustrating various embodiments of the invention and are not meant 
to limit the present invention in any fashion. 



25 



EXAMPLE 1 



Ehrlichias and Purification 

Ehrlichia canis (Florida strain and isolates Demon, DJ, 
30 Jake, and Fuzzy) were provided by Dr. Edward Breitschwerdt, (College 
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of Veterinary Medicine, North Carolina State University, Raleigh, NC). 
E canis (Louisiana strain) was provided by Dr. Richard E. Corstvet 
(School of Veterinary Medicine, Louisiana State University, Baton 
Rouge, LA) and £ canis (Oklahoma strain) was provided by Dr. 
5 Jacqueline Dawson (Centers for Disease Control and Prevention, 
Atlanta, GA). Propagation of ehrlichiae was performed in DH82 cells 
with DMEM supplemented with 10% bovine calf serum and 2 mM L- 
glutamine at 37*C. The intracellular growth in DH82 cells was 
monitored by presence of £ canis morulae using general cytologic 
0 staining methods. Cells were harvested when 100% of the cells were 
infected with ehrlichiae and were then pelleted in a centrifuge at 
17,000 x g for 20 min. Cell pellets were disrupted with a Braun-Sonic 
2000 sonicator twice at 40W for 30 sec on ice. Ehrlichiae were 
purified as described previously (Weiss et at, 1975). The lysate was 
loaded onto discontinuous gradients of 42%-36%-30% renografin, and 
centrifuged at 80,000 x g for 1 hr. Heavy and light bands containing 
ehrlichiae were collected and washed with sucrose-phosphate- 
glutamate buffer (SPG, 218 mM sucrose, 3.8 mM KH 2 P0 4 , 7.2 mM 
K 2 HP0 4 , 4.9 mM glutamate, pH 7.0) and pelleted by centrifugation. 

EXAMELE-2 

NncltMO Acid Preparation 

Ehrlichia canis genomic DNA was prepared by 
resuspending the renografin-purified ehrlichiae in 600 ul of 10 mM 
Tris-HCl buffer (pH 7.5) with 1% sodium dodecyl sulfate (SDS, w/v) 
and 100 ng/ml of proteinase K as described previously (McBride et al, 
1996). This mixture was incubated for 1 hr at 56° C, and the nucleic 
acids were extracted twice with a mixture of 
phenol/chloroform/isoamyl alcohol (24:24:1). DNA was pelleted by 
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absolute ethanol precipitation, washed once with 70% ethanol, dried 
and resuspended in lOmM Tris (pH 7.5). Plasmid DNA was purified by 
using High Pure Plasmid Isolation Kit (Boehringer Mannheim, 
Indianapolis, IN), and PCR products were purified using a QIAquick PCR 
5 Purification Kit (Qiagen, Santa Clarita, CA). 

EXAMPLE 3 

PCR Amplification of the R. canis ?8-kDa protein genfiS 
10 Regions of the £ canis ECa28-l gene selected for PCR 

amplification were chosen based on homology observed (>90%) in the 
consensus sequence generated from Jotun-Hein aligorithm alignment 
of E. chaffeensis p28 and Cowdria ruminantium map-1 genes. Forward 
primer 793 (5-GCAGGAGCTGTTGGTTACTC-3') (SEQ ID NO. 16) and 

15 reverse primer 1330 (S'-CCTTCCTCCAAGTTCTATGCC-S') (SEQ ID NO. 
17) corresponded to nucleotides 313-332 and 823-843 of C. 
ruminantium MAP-1 and 307-326 and 834-814 of E. chaffeensis P28. 
E canis (a North Carolina isolate, lake) DNA was amplified with 
primers 793 and 1330 with a thermal cycling profile of 95°C for 2 

20 min, and 30 cycles of 95°C for 30 sec, 62° C for 1 min, 72°C for 2 min 
followed by a 72°C extension for 10 min and 4°C hold. PCR products 
were analyzed on 1% agarose gels. This amplified PCR product was 
sequenced directly with primers 793 and 1330. 

Primers specific for ECa28SA2 gene designated 46f (5'- 

25 ATATACTTCCTACCTAATGTCTCA-3 , > SEQ ID No. 18) and primer 1330 
(SEQ ID No. 17) were used to amplify the targeted region. The 
amplified product was gel purified and cloned into a TA cloning vector 
(Invitrogen, Santa Clarita, CA). The clone was sequenced 
bidirectionally with primers: Ml 3 reverse from the vector, 46f, 

30 ECa28SA2 ^-AGTGCAGAGTCTTCGGTTTC-3', SEQ ID No. 19), ECa5.3 
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(5'-GTTACTTGCGGAGGACAT-3'. SEQ ID No. 20). DNA was amplified 
with a thermal cycling profile of 95°C for 2 min, and 30 cycles of 95°C 
for 30 sec, 48°C for 1 min, 72°C for 1 min followed by a 72°C 
extension for 10 min and 4°C hold. 

5 

EXAMPLE 4 

Sequencing Unknown 51 and T Regions of the ECa28-I Gene 

The full length sequence of ECa28-l was determined using 

10 a Universal GenomeWalker Kit (CLONTECH, Palo Alto, CA) according to 
the protocol supplied by the manufacturer. Genomic £ canis (Jake 
isolate) DNA was digested completely with five restriction enzymes 
(Drah EcoRV, Pvull, Seal, Stul) which produce blunt-ended DNA. An 
adapter (API) supplied in the kit was ligated to each end of E canis 

15 DNA. The genomic libraries were used as templates to find the 
unknown DNA sequence of the ECa28-l gene by PCR using a primer 
complementary to a known portion of the ECa28-l sequence and a 
primer specific for the adapter API. Primers specific for ECa28-l used 
for genome walking were designed from the known DNA sequence 

20 derived from PCR amplification of ECa28-l with primers 793 (SEQ ID 
NO. 16) and 1330 (SEQ ID NO. 17). Primers 394 (5'- 
GCATTTCCACAGGATCATAGGTAA-y; nucleotides 687-710, SEQ ID NO. 
21) and 394C (S'-TTACCTATGATCCTGT GGAAATGC-3; nucleotides 
710-687, SEQ ID NO. 22) were used in conjunction with supplied 

25 primer API to amplify the unknown 5' and 3* regions of the ECa28-l 
gene by PCR. A PCR product corresponding to the 5' region of the 
ECa28-I gene amplified with primers 394C and API (2000-bp) was 
sequenced unidirectionally with primer 793C (5*-GAGTA 
ACCAACAGCTCCTGC-3\ SEQ ID No. 23). A PCR product corresponding 

28 
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to the 3' region of the ECa28-l gene amplified with primers 394 and 
API (580-bp) was sequenced bidirectionally with the same primers. 
Noncoding regions on the 5' and 3' regions adjacent to the open 
reading frame were sequenced, and primers EC280M-F (5*- 
5 TCTACTTTGCACTTCC ACTATTGT-3\ SEQ ID NO. 24) and EC280M-R (5 ' - ■ 
ATTCTTTTGCCACTATTT TTCTTT-3\ SEQ ID NO. 25) complementary to 
these regions were designed in order to amplify the entire ECa28-l 
gene. 

10 EXAMPLES 

SftqjifinHng of K ranis isolates 

DNA was sequenced with an ABI Prism 377 DNA Sequencer 

(Perkin- Elmer Applied Biosystems, Foster City, CA). The entire Eca28- 
15 / genes of seven E canis isolates (four from North Carolina, and one 

each from Oklahoma, Florida, and Louisiana) were amplified by FCR 

with primers EC280M-F (SEQ ID No. 24) and EC280M-R (SEQ ID No. 

25) with a thermal cycling profile of 95°C for 5 minutes, and 30 cycles 

of 95°C for 30 seconds, 62°C for 1 minutes, and 72°C for 2 minutes 
20 and a 72°C extension for 10 minutes. The resulting PCR products were 

bidirectionally sequenced with the same primers. 
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EXAMPLE 6 



Pinning and Expression of F rnnh ECa28zl 

The entire £ canis ECa28-l gene was PCR-amplified with 
primers-EC280M-F and EC280M-R and cloned into pCR2.I-TOPO TA 
cloning vector to obtain the desired set of restriction enzyme cleavage 
30 sites (Invitrogen, Carlsbad, CA). The insert was excised from pCR2.1- 

29 
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TOPO with BstX I and ligated into pcDNA 3.1 eukaryotic expression 
vector (Invitrogen, Carlsbad, CA) designated pcDNA3.1/EC28 for 
subsequent studies. The pcDNA3.1/EC28 plasmid was amplified, and 
the gene was excised with a Kpnl-Xbal double digestion and 
5 directionally ligated into pThioHis prokaryotic expression vector 
(Invitrogen, Carlsbad, CA). The clone (designated pThioHis/EC28) 
produced a recombinant thioredoxin fusion protein in Escherichia coli 
BL21. The recombinant fusion protein was crudely purified in the 
insoluble phase by centrifugation. The control thioredoxin fusion 
10 protein was purified from soluble cell lysates under native conditions 
using nickel-NTA spin columns (Qiagen, Santa Clarita, CA). 

EXAMELEJZ 

15 

WpRtprn Immnnohlot Analysis 

Recombinant E. canis ECa28-l fusion protein was subjected 
to SDS-polyacrylamide gel electrophoresis (SDS-PAGE) on 4-15% Tris- 
HC1 gradient gels (Bio-Rad, Hercules, CA) and transferred to pure 

20 nitrocellulose (Schleicher & Schuell, Keene, NH) using a semi-dry 
transfer cell (Bio-Rad, Hercules, CA). The membrane was incubated 
with convalescent phase antisera from an £. canis-infected dog diluted 
1:5000 for 1 hour, washed, and then incubated with an anti-canine IgG 
(H & L) alkaline phosphatase-conjugated affinity-purified secondary 

25 antibody at 1:1000 for 1 hour (Kirkegaard & Perry Laboratories, 
Gaithersburg, MD). Bound antibody was visualized with 5-bromo-4- 
chloro-3-indolyI phosphate/nitroblue tetrazolium (BCIP/NBT) 
substrate (Kirkegaard & Perry Laboratories, Gaithersburg, MD). 
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EXAMELO 

Southern Blot Analysis 

To determine if multiple genes homologous to the ECa28-l 
5 gene were present in the £ cants genome, a genomic Southern blot 
analysis was performed using a standard procedure (Sambrook et al. 
1989). £ cams genomic DNA digested completely with each of the 
restriction enzymes Banll, EcoRV, Haell, Kpnl and Spel, which do not 
cut within the ECa28-I gene, and Asel which digests ECa28-I at 

10 nucleotides 34, 43 and 656. The probe was produced by FCR 
amplification with primers EC280M-F and EC280M-R and digoxigenin 
(DIG)-labeled deoxynucleotide triphosphates (dNTPs) (Boehringer 
Mannheim, Indianapolis, IN) and digested with AseL The digested 
probe (566-bp) was separated by agarose gel electrophoresis, gel- 

15 purified and then used for hybridization. The completely digested 
genomic E. canis DNA was electrophoresed and transferred to a nylon 
membrane (Boehringer Mannheim, Indianapolis, IN) and hybridized at 
40°C for 16 hr with the ECa28-l gene DIG-labeled probe in DIG Easy 
Hyb buffer according to the manufacturer's protocol (Boehringer 

20 Mannheim, Indianapolis, IN). Bound probe was detected with a anti- 
DIG alkaline phosphatase-conjugated antibody and a luminescent 
substrate (Boehringer Mannheim, Indianapolis, IN) and exposed to 
BioMax scientific imaging film (Eastman Kodak, Rochester, NY). 

25 F,XAMPLE 9 

Sequence Analysis and Companion 

E chaffeensis p28 and C ruminantium map-1 DNA 
sequences were obtained from the National Center of Biotechnology 
30 Information (NCB1) (World Wide Web site at URL: 
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http://www.ncbi.nlm.nih.gov/Entrez). Nucleotide and deduced amino 
acid sequences, and protein and phylogenetic analyses were 
performed with LASERGENE software (DNASTAR, Inc., Madison, WI). 
Analysis of post-translational processing was performed by the 
method of McGeoch and von Heijne for signal sequence recognition 
using the PSORT program (McGeoch, 1985; von Heijne, 1986) (World 

Wide Web site at URL: PRIVATE HREF= "http://www.imcb.osaka- 

u.ac.jp/nakai/form.htnr, MACROBUTTON HtmlResAnchor 

http://www.imcb.osaka-u.ac.jp/nakai/form.htm). 

GenBank accession numbers for nucleic acid and amino 

acid sequences of the £ canis ECa28-l genes described in this study 

are: Jake, AF082744; Louisiana, AF082745; Oklahoma, AF082746; 

Demon, AF082747; DJ, AF082748; Fuzzy, AF082749; Florida, 

AF082750. 

Sequence analysis of ECa28-l from seven different strains 
of E. canis was performed with primers designed to amplify the entire 
gene. Analysis revealed the sequence of this gene was conserved 
among the isolates from North Carolina (four), Louisiana, Florida and 
Oklahoma. 

EXAMELE i» 

PCR Amplification Cloninp Sen nencinp and Expression of F.CalR- 1 

Alignment of nucleic acid sequences from £ chaff eensis 
p28 and Cowdria ruminantium map ] using the Jotun-Hein aligorithm 
produced a consensus sequence with regions of high homology 
(>90%). These homologous regions (nucleotides 313-332 and 823- 
843 of C. ruminantium map-1; 307-326 and 814-834 of £ chaffeensis 
p28) were targeted as primer annealing sites for PCR amplification. 
PCR amplification of the £ canis ECa28-l and £ chaffeensis P 28 gene 
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was accomplished with primers 793 and 1330, resulting in a 518-bp 
PCR product. The nucleic acid sequence of the £ canis PCR product 
was obtained by sequencing the product directly with primers 793 and 
1330. Analysis of the sequence revealed an open reading frame 
5 encoding a protein of 170 amino acids, and alignment of the 518-bp 
sequence obtained from PCR amplification of E canis with the DNA 
sequence of E. chaffeensis p28 gene revealed a similarity greater than 
70%, indicating that the genes were homologous. Adapter PCR with 
primers 394 and 793C was performed to determine the 5' and 3 f 

10 segments of the sequence of the entire gene. Primer 394 produced 
four PCR products (3-kb, 2-kb, 1-kb, and 0.8-kb), and the 0.8-bp 
product was sequenced bidirectionally using primers 394 and API. 
The deduced sequence overlapped with the 3 r end of the 518-bp 
product, extending the open reading frame 12-bp to a termination 

15 codon. An additional 625-bp of non-coding sequence at the 3' end of 
the ECa28-l gene was also sequenced. Primer 394C was used to 
amplify the 5* end of the ECa28-I gene with supplied primer API. 
Amplification with these primers resulted in three PCR products (3.3, 
3-kb, and 2-kb). The 2-kb fragment was sequenced unidirectionally 

20 with primer 793C. The sequence provided the putative start codon of 
the ECa28-l gene and completed the 834-bp open reading frame 
encoding a protein of 278 amino acids. An additional 144-bp of 
readable sequence in the 5' noncoding region of the ECa28 1 gene was 
generated. Primers EC280M-F and EC280M-R were designed from 

25 complementary non-coding regions adjacent to the ECa28-l gene. 

The PCR product amplified with these primers was 
sequenced directly with the same primers. The complete DNA 
sequence (SEQ ID NO. 1) for the E canis ECa28-I gene is shown in 
Figure 1. The ECa28-l PCR fragment amplified with these primers 

3D conlained the entire open reading frame and 17 additional amino 

33 
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acids from the 5' non-coding primer region. The gene was 
directionally subcloned into pThioHis expression vector, and E coli 
(BL2I) were transformed with this construct. The expressed ECa28-l- 
thioredoxin fusion protein was insoluble. The expressed protein had 
5 an additional 114 amino acids associated with the thioredoxin, 5 
amino acids for the enterokinase recognition site, and 32 amino acids 
from the multiple cloning site and 5' non-coding primer region at the 
N-terminus. Convalescent-phase antiserum from an E canis infected 
dog recognized the expressed recombinant fusion protein, but did not 
10 react with the thioredoxin control (Figure 2). 

EXAMPLE 11 

Sequence Homology 

15 The nucleic acid sequence of ECa28-l (834-bp) and the £ 

chaff eensis omp-1 family of genes including signal sequences (ECa28- 
l y omp-lA 9 B t C, A £> and F) were aligned using the Clustal method to 
examine homology between these genes (alignment not shown). 
Nucleic acid homology was equally conserved (68.9%) between ECa28- 

20 7, and E chaff eensis p28 and omp-lF. Other putative outer 
membrane protein genes in the E chaffeensis omp-1 family, omp-lD 
(68.2%), ompAE (66.7%), omp-\C (64.1%), Cowdria ruminantium 
map-1 (61.8%), £ canis 28-kDa protein 1 gene (60%) and 28-kDa 
protein 2 gene (partial) (59.5%) were also homologous to ECa28-L E 

25 chaffeensis omp-lB had the least nucleic acid homology (45.1%) with 
E.Ca28-L 

Alignment of the predicted amino acid sequences of 
ECa28-l (SEQ ID NO. 2) and E chaffeensis P28 revealed amino acid 
substitutions resulting in four variable regions (YR). Substitutions or 
30 fleletions in the amino acid sequence and the locations of variable 
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regions of ECa28-l and the E. chaffeensis OMP-1 family were identified 
(Figure 3). Amino acid comparison including the signal peptide 
revealed that ECa28-l shared the most homology with OMP-1F (68%) 
of the K chaffeensis OMP-1 family, followed by £ chaffeensis P28 
5 (65.5%), OMP-1E (65.1%), OMP-1D (62.9%), OMP-1C (62.9%), 
Cowdria ruminantium MAP-1 (59.4%), £ canis 28-kDa protein 1 
(55.6%) and 28-kDa protein 2 (partial) (53.6%), and OMP-1B 
(43.2%). The phylogenetic relationships based on amino acid 
sequences show that ECa28-l and C. ruminantium MAP-1, £ 
10 chaffeensis OMP-1 proteins, and £ cants 28-kDa proteins 1 and 2 
(partial) are related (Figure 4). 

F.YAMPT E 1? 

15 Predicted Surface Prohahility and Tminii nnrpflnivity 

Analysis of £ canis ECa28-l using hydropathy and 
hydrophilicity profiles predicted surface-exposed regions on ECa28-l 
(Figure 6). Eight major surface-exposed regions consisting of 3 to 9 
amino acids were identified on ECa28-l and were similar to the profile 

20 of surface-exposed regions on £ chaffeensis P28 (Figure 6). Five of 
the larger surface-exposed regions on ECa28-l were located in the N- 
terminal region of the protein. Surface-exposed hydrophilic regions 
were found in all four of the variable regions of ECa28-l. Ten T-cell 
motifs were predicted in the ECa28-l using the Rothbard-Taylor 
25 aligorithm (Rothbard and Taylor, 1988), and high antigenicity of the 
ECa28-l was predicted by the Jameson-Wolf antigenicity aligorithm 
(Figure 6) (Jameson and Wolf, 1988). Similarities in antigenicity and 
T-cell motifs were observed between ECa28-l and £ chaffeensis P28. 

30 
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EXAMPLE n 

Detection Of H omolopons Geno mic Cnpitx; of F.Cn?H-l On** 

Genomic Southern blot analysis of E. canis DNA completely 
digested independently with restriction enzymes Banll, EcoRV, Haell, 
Kpril, Spel, which do not have restriction endonuclease sites in the 
ECa28-I gene, and Asel, which has internal restriction endonuclease 
sites at nucleotides 34, 43 and 656, revealed the presence of at least 
three homologous ECa28-J gene copies (Figure 5). Although ECa28-l 
has internal Ase I internal restriction sites, the DIG-labeled probe used 
in the hybridization experiment targeted a region of the gene within a 
single DNA fragment generated by the Asel digestion of the gene. 
Digestion with A*el produced 3 bands (approximately 566-bp, 850 - 
bp, and 3-kb) that hybridized with the ECa28-l DNA probe indicating 
the presence of multiple genes homologous to ECa28-l in the genome. 
Digestion with EcoRV and Spel produced two bands that hybridized 
with the ECa28-l gene probe. 

FX A MPT F 14 

Identification of ?R-kDa Pmr^in r, Pn( » r nr,. t 

Specific primers designated ECaSA3-2 (5*-CTAGGATTA 
GGTTATAGTATAAGTT-3', SEQ ID No. 26) corresponding to regions 
within ECa28SA3 and primer 793C (SEQ ID No. 23) which anneals to a 
region with ECa28-J were used to amplify the intergenic region 
between gene SA3 and ECa28-J. The 800-bp product was sequenced 
with the same primers. DNA was amplified with a thermal cycling 
profile of 95°C for 2 min, and 30 cycles of 95 °G for 30 sec, 50°C for I 
min, 72°C for 1 min followed by a 72°C extension for 10 min and 4°C 
hold. 
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PCR Ampl ification of 28-kDn Protein denes and identification of the. 
Multiple dene I^cus 
5 In order to specifically amplify possible unknown genes 

downstream of ECa28SA2, primer 46f specific for ECa28SA2, and 
primer 1330 which targets a conserved region on the 3 r end of ECa28- 
1 gene were used for amplification. A 2-kb PCR product was amplified 
with these primers that contained 2 open reading frames. The first 

10 open reading frame contained the known region of gene, ECaSA2, and 
a previously unsequenced 3' portion of the gene. Downstream from 
ECaSA2 an additional non identical, but homologous 28-kDa protein 
gene was found, and designated ECa28SA3. The two known loci were 
joined by amplification with primer SA3-2 specific for the 3' end of 

15 ECa28SA3 gene was used in conjunction with a reverse primer 793C, 
which anneals at 5* end of ECa28-L An 800-bp PCR product was 
amplified which contained the 3' end of Eca28SA3 y the intergenic 
region between ECa28SA3 and ECa28-l (28NC3) and the 5' end of 
Eca28-L joining the previously separate loci (Figure 8). The 849-bp 

20 open reading frame of ECa28SA2 encodes a 283 amino acid protein, 
and ECa28SA3 has an 840-bp open reading frame encoding a 280 
amino acid protein. The intergenic noncoding region between 
ECa28SA3 and ECa28-l was 345-bp in length (Figures 7 and 8) 

25 KXAMPIF 16 

Nucleic and Amino Acid Homology 

The nucleic and amino acid sequences of all five E. canis 
28-kDa protein genes were aligned using the Clustal method to 
30 examine the homology between these genes. The nucleic acid 
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homology ranged from 58 to 75% and a similar amino acid homology 
of ranging from 67 to 72% was observed between the E. cahis 28-kDa 
protein gene members (Figure 9). 

5 EXAMPLE 17 



Transcriptio nal Promoter Regions 

The intergenic regions between the 28-kDa protein genes 

were analyzed for promoter sequences by comparison with consensus 
10 Escherichia coli promoter regions and a promoter from EL chaffeensis 

(Yu et a/:, 1997; McCiure, 1985). 

Putative promoter sequences including RBS, -10 and -35 

regions were identified in 4 intergenic sequences corresponding to 

genes ECa28SA2 y ECa28SA3, ECa28-l y and ECa28-2 (Figure 10). The 
15 upstream noncoding region of ECa28SAl is not known and was not 

analyzed. 

EXAMPLE 18 

20 N-Terminal Signal Sequence 

The amino acid sequence analysis revealed that entire 
E.canis ECa28-l has a deduced molecular mass of 30.5-kDa and the 
entire ECa28SA3 has a deduced molecular mass of 30.7-kDa. Both 
proteins have a predicted N-terminal signal peptide of 23 amino acids 

25 (MNCKJCILTTTAL^ SEQ ID No. 27), which is similar to that 

predicted for E chaffeensis P28 (MNYKKILTrSALISL^ SFS, SEQ ID 
NO. 28), and the OMP-1 protein family (Yu et aL, 1998; Ohashi et ai 7 
1998b). A preferred cleavage site for signal peptidases (SIS; Ser-X- 
Ser) (Oliver, 1985) is found at amino acids 21, 22, and 23 of ECa28-l. 

30 An additional putative cleavage site at amino acid positron 25 
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(MNCKKIUT^^ SEQ ID NO. 29) identical to the 

predicted cleavage site of £ chaff eensis P28 (SFS) was also present, 
and would result in a mature ECa28-l with a predicted molecular mass 
of 27.7-kDa. Signal cleavage site of the previously reported partial 
5 sequence of ECa28SA2 is predicted at amino acid 30. However, signal 
sequence analysis predicted that ECa28SAl had an uncleavable signal 
sequence. 
Summary 

Proteins of similar molecular mass have been identified 

10 and cloned from multiple rickettsial agents including K canis, E 
chaff eensis, and C rurninantium (Reddy et a/., 1998; Jongejan et aL, 
1993; Ohashi et ai y 1998). A single locus in Ehrlichia chaffeensis with 
6 homologous p28 genes, and 2 loci in £. canis, each containing some 
homologous 28-kDa protein genes have been previously described. 

15 The present invention demonstrated the cloning, 

expression and characterization of genes encoding a mature 28-kDa 
protein of E canis that are homologous to the omp-1 multiple gene 
family of E. chaffeensis and the C. rurninantium map-I gene. Two new 
28-kDa protein genes were identidfied, Eca28-J and ECa28SA3. 

20 Another E.canis 28-kDa protein gene, ECa28SA2 y partially sequenced 
previously (Reddy et a/., 1998), was sequenced completely in the 
present invention. Also disclosed is the identification and 
characterization of a single locus in E.canis containing all five E.canis 
28-kDa protein genes. 

25 The E.canis 28-kDa protein are homologous to 

E.chaffeensis OMP-1 family and the MAP-1 protein of C rumanintium. 
The most homologous £ canis 28-kDa proteins (ECa28SA3, ECa28-l 
and ECa28-2) are sequentially arranged in the locus. Homology of 
these proteins ranged from 67.5% to 72.3%. Divergence among these 

30 28-kDa proteins was 273% to 38.6%. E canis 28-kDa proteins 
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ECa28SAl and ECa28SA2 were the least homologous with homology 
ranging from 50.9% to 59.4% and divergence of 53.3 to 69.9%. 
Differences between the genes lies primarily in the four hypervariable 
regions and suggests that these regions are surface exposed and 
5 subject to selective pressure by the immune system. Conservation of 
ECa28-I among seven £ canis isolates has been reported (McBride el 
aL 9 1999), suggesting that £ cam's may be clonal in North America. 
Conversely, significant diversity of p28 among £ chaffeensis isolates 
has been reported (Yu el a/., 1998). 

10 All of the £ canis 28-kDa proteins appear to be post 

translationally processed from a 30-kD protein to a mature 28^kD 
protein. Recently, a signal sequence was identified on £ chaffeensis 
P28 (Yu el al y 1998), and N-lerminal amino acid sequencing has 
verified that the protein is post-translationally processed resulting in 

15 cleavage of the signal sequence to produce a mature protein (Oh as hi 
el aL 9 1998). The leader sequences of OMP-1F and OMP-1E have also 
been proposed as leader signal peptides (Ohashi el al. y 1998). Signal 
sequences identified on £ chaffeensis OMP-1F, OMP-1E and P28 are 
homologous to the leader sequence of £ canis 28-kDa protein. 

20 Promoter sequences for the p28 genes have not been determined 
experimentally, but putative promoter regions were identified by 
comparison with consensus sequences of the RBS, -10 and -35 
promoter regions of £ coli and other ehrlichiae (Yu el aL y 1997; 
McClure, 1985). Such promoter sequences would allow each gene to 

25 potentially be transcribed and translated, suggesting that these genes 
may be differentially expressed in the host. Persistence of infection in 
dogs may be related to differential expression of p28 genes resulting 
in antigenic changes in vivo, thus allowing the organism to evade the 
immune response. 
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The £ canis 28-kda protein genes were found to exhibit 
nucleic acid and amino acid sequence homology with the £ 
chaffeensis omp-1 gene family and C ruminantium map-/ gene. 
Previous studies have identified a 30-kDa protein of £ canis that 
5 reacts with convalescent phase antisera against £ chaffeensis, but was 
believed to be antigenically distinct (Rikihisa et aL y 1994). Findings 
based on comparison of amino acid substitutions in four variable 
regions of £ canis 28-kDa proteins support this possibility. Together 
these findings also suggest that the amino acids responsible for the 

10 antigenic differences between £ canis and £ chaffeensis P28 are 
located in these variable regions and are readily accessible to the 
immune system. It was reported that immunoreactive peptides were 
located in the variable regions of the 28-kDa proteins of C. 
ruminantium, £ chaffeensis and £ canis (Reddy et ai y 1998). Analysis 

15 of £ canis and £ chaffeensis P28 revealed that all of the variable 
regions have predicted surface-exposed amino acids. A study in dogs 
demonstrated lack of cross protection between £ canis and £ 
chaffeensis (Dawson and Ewing, 1992). This observation may be 
related to antigenic differences in the variable regions of P28 as well 

20 as in other immunologically important antigens of these ehrlichiai 
species. Another study found that convalescent phase human antisera 
from £ chaffeensis-infecied patients recognized 29/28-kDa protein(s) 
of £ chaffeensis and also reacted with homologous proteins of £ canis 
(Chen et al. y 1997). Homologous and crossreactive epitopes on the £ 

25 canis 28-kDa protein and £ chaffeensis P28 appear to be recognized 
by the immune system. 

£ canis 28-kDa proteins may be important 
immunoprotective antigens. Several reports have demonstrated that 
the 30-kDa antigen of £ canis exhibits strong immunoreactivity 

30 (Rikihisa et a/., 1994; Rikihisa et al y 1992). Antibodies in 
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convalescent phase antisera from humans and dogs have consistently 
reacted with proteins in this size range from £ chaffeensis and £ 
canis, suggesting that they may be important immunoprotective 
antigens (Rikihisa et aL, 1994; Chen et aL, 1994; Chen et aL, 1997). In 
5 addition, antibodies to 30, 24 and 21-kDa proteins developed early in 
the immune response to £ canis (Rikihisa et aL, 1994; Rikihisa et aL, 
1992), suggesting that these proteins may be especially important in 
the immune responses in the acute stage of disease. Recently, a family 
of homologous genes encoding outer membrane proteins with 
0 molecular masses of 28-kDa have been identified in £ chaffeensis, and 
mice immunized with recombinant £ chaffeensis P28 appeared to 
have developed immunity against homologous challenge (Ohashi et 
aL, 1998). The P28 of £ chaffeensis has been demonstrated to be 
present in the outer membrane, and immunoelectron microscopy has 
5 localized the P28 on the surface on the organism, and thus suggesting 
that it may serve as an adhesin (Ohashi et aL, 1998). It is likely that 
the 28-kDa proteins of £ canis identified in this study have the same 
location and possibly serve a similar function. 

Comparison of ECa28-l from different strains of £ canis 
revealed that the gene is apparently completely conserved. Studies 
involving £ chaffeensis have demonstrated immunologic and 
molecular evidence of diversity in the ECa28-l. Patients infected with 
£ chaffeensis have variable immunoreactivity to the 29/28-kDa 
proteins, suggesting that there is antigenic diversity (Chen et aL, 

1997) . Recently molecular evidence has been generated to support 
antigenic diversity in the P 28 gene from £ chaffeensis (Yu et aL, 

1998) . A comparison of five £ chaffeensis isolates revealed that two 
isolates (Sapulpa and St. Vincent) were 100% identical, but three 
others (Arkansas, Jax, 91HE17) were divergent by as much as 13.4% 
at the amino acid level. The conservation of ECa28-l suggests that £ 
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canis strains found in the United States may be genetically identical, 
and thus E. canis 28-kDa protein is an attractive vaccine candidate for 
canine ehrlichiosis in the United States. Further analysis of E. canis 
isolates outside the United States may provide information regarding 
the origin and evolution of E canis. Conservation of the 28-kDa 
protein makes it an important potential candidate for reliable 
serodiagnosis of canine ehrlichiosis. 

The role of multiple homologous genes is not known at 
this point; however, persistence of Ecanis infections in dogs could 

conceivably be related to antigenic variation due to variable 

expression of homologous 28-kDa protein genes, thus enabling E 

canis to evade immune surveillance. Variation of msp-3 genes in A. 

mdrginale is partially responsible for variation in the MSP-3 protein, 

resulting in persistent infections (Alleman et aL, 1997). Studies to 

examine 28-kDa protein gene expression by E canis in acutely and 

chronically infected dogs would provide insight into the role of the 

28-kDa protein gene family in persistence of infection. 
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Any patents or publications mentioned in this specification 
are indicative of the levels of those skilled in the art to which the 
invention pertains. These patents and publications are herein 
incorporated by reference to the same extent as if each individual 
publication was individually indicated to be incorporated by 
reference. 

One skilled in the art will readily appreciate that the 
present invention is well adapted to carry out the objects and obtain 
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the ends and advantages mentioned, as well as those inherent therein. 
The present examples along with the methods, procedures, 
treatments, molecules, and specific compounds described herein are 
presently representative of preferred embodiments, are exemplary, 
5 and are not intended as limitations on the scope of the invention. 
Changes therein and other uses will occur to those skilled in the art 
which are encompassed within the spirit of the invention as defined by 
the scope of the claims. 
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1. DNA sequences encoding a 30-kilodalton protein of 
Ehrlichia canis, wherein said protein is immunoreactive with anti- 
5 Ehrlichia canis serum. 



2. The DNA sequences of claim 1, wherein said protein 
has an amino acid sequence selected from the group consisting of SEQ 

1 0 ID No. 2, SEQ ID No. 4 and SEQ ID No. 6. 

3. The DNA sequences of claim 2, wherein said protein 
has an N-terminal signal sequence. 

15 

4. The DNA sequences of claim 3, wherein said protein 
is post-translationally modified to a 28-kiIodalton protein. 

20 

5. The DNA sequences of claim 1, wherein said DNA has 
a sequence selected from the group consisting of SEQ ID No. 1, SEQ ID 
No. 3 and SEQ ID No. 5. 

25 

6. The DNA sequences of claim 1, wherein said DNA is 
contained in a single locus of Ehrlichia canis. 
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7. The DNA sequences of claim 6, wherein said locus is 
a multigene locus of 5.592 kb in length. 

5 8. The DNA sequences of claim 7, wherein said locus 

encoding homologous 28-kilodaltori proteins of Ehrlichia canis. 

- 9. The DNA sequences of claim 8, wherein said 

10 homologous 28-kilodalton proteins of Ehrlichia canis are selected 
from the group consisting of ECa28SAl, ECa28SA2, ECa28SA3, ECa28-l 
and ECa28-2. 

10. A vector comprising the DNA sequences of claim 1. 

15 

11. The vector of claim 10, wherein said vector is an 
expression vector capable of expressing a peptide or polypeptide 
encoded by the sequence selected from the group consisting of SEQ ID 

20 No. I, SEQ ID No. 3 and SEQ ID No. 5 when said expression vector is 
introduced into a cell. 

12. A recombinant protein comprising the amino acid 
25 sequence selected from the group consisting of SEQ ID No. 2, SEQ ID 

No. 4 and SEQ ID No. 6. 

13. The recombinant protein of claim 12, wherein said 
30 amino acid sequence is encoded by a nucleic acid segment comprising 

4 7 
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a sequence selected from the group consisting of SEQ ID No. 1, SEQ ID 
No. 3 and SEQ ID No. 5. 

5 14. A host cell comprising the nucleic acid segment 

selected from the group consisting of SEQ ID No. 1, SEQ ID No. 3 and 
SEQ ID No. 5. 

10 15. A method of producing the recombinant protein of 

claim 12, comprising the steps of: 

obtaining a vector that comprises an expression region 
comprising a sequence encoding the amino acid sequence selected 
from the group consisting of SEQ ID No. 2, SEQ ID No. 4 and SEQ ID 
15 No. 6 operatively linked to a promoter; 

transfecting said vector into a cell; and 

culturing said cell under conditions effective for 
expression of said expression region. 

20 

16. An antibody immunoreactive with an amino acid 
sequence selected from the group consisting of SEQ ID No. 2, SEQ ID 
No. 4 and SEQ ID No. 6. 

25 17. A method of inhibiting Ehrlichia can is infection in a 

subject comprising the steps of: 

identifying a subject suspected of being exposed to or 
infected with Ehrlichia canis; and 
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administering a composition comprising a 28-kDa antigen 
of Ehrlichia canis in an amount effective to inhibit an Ehrlichia canis 
infection. 

5 

18. The method of claim 17, wherein said 28-kDa 
antigen is a recombinant protein comprising an amino acid sequence 
selected from the group consisting of SEQ ID No. 2, SEQ ID No. 4 and 
SEQ ID No. 6. 

0 

19. The method of claim 18, wherein said recombinant 
protein is encoded by a gene comprising a sequence selected from the 
group consisting of SEQ ID No. I, SEQ ID No. 3 and SEQ ID No. 5. 

5 20. The method of claim 18, wherein said recombinant 

protein is dispersed in a pharmaceutically acceptable carrier. 
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1 ATTITATTTATrACCAATCTiTATATAATATAT^ 60 

61 TITTATACCTAATAraTATATTCTG^ 120 

121 TTATirTCACTATTTTAGCTGTMTATCMrrGCAAAAAMTTCTTA 180 

M N C K K I L I T T A f 

161 AATATCAITAATGTACTCTATTCCAAGCATATCTTITTCTC 240 
I S f H Y S — I P S T ft F S O T I Q D G N 

241 CATGGGTGGTMCTTCTATATTAGTGGAAAGTATGTACCAACT 300 
MQGNFYI SGKYVPSVSHF GS 

301 CITCTCAGCTAAAGAAGAAAGCAAATCAACTGTTGGAGTrriTGGATTAAAACATGAT^ 360 
FSA KEES RSTVGVFGLK HDW 

36 i. G^ATGGAAGTCCMTACTTMGAATAMCACGGTGACTrTACTGTrCCAAACTATTCG^ 420 
DGSPILK.MKHADFT VPNYSF 

421 CAGATACGAGMCAATCCATITCTAGGGTTTGCAGGAGCTATCGGTT^ 480 
RYEHHPFLOFAO.A-I GYSMGG 

481 CCCAAGAATAGAATTCGAAATATCTTATGMGCATrCGACGTAAAAAGTCCTAATATCAA 540 
P RIEFE I SYEAF DVKSPNI N 

541 TTATCAAAATGACGCGCACAGGTACTGCGCTCTATCTCATCACACATCGGCAGCCATGGA 600 
YQHDA HRYCALSHHTSAAME 

601 AGCTGATAAATTTGTc i lei 1 AAAAAACGAAGGGTTAATTGACATATCACTTGCAATAAA 660 
ADRFV FLRNEGLIDISLAIN 

661 TGCATGTTATGATATAATAAATGACAAAGTACCTG1 n L'lL'L'l 1ATATATGCGCAGGTAT 720 
ACYDI INDKVPVSPYICAGI 

721 TGGTACTGATTTGATITCTATGTTTGMGCTACAA 780 
GTDL I SMFEATSPKI SY QG R 

781 ACTGGGCATTAGTTACTCrATTAATCCGGAAACCTCTGTTrrCATGGGTGGGCATrrCCA 840 
L G ISYS I NPETSVF IGGHFH 

841 CAGGATCATAGGTMTGAGTTTAGAGATATTCCTGCAATAGTACCTAGTAACTCAACTAC 900 
uilGNEFRDl PAIVPSNSTT 

901 AATAAGTGGACCACAATITGCAACAGTAACACTAAArc 960 
I SGPQFATVTLNV CHFGLEL 

961 IGGAGGAAGATTTAACTrCTAATTTTATTGTrGCCACATATTAA 1020 
G G R F N F (SEQ ID NO: 2) 

1021 "1111 i AVrATlTJCTACATACAAAAAMGAAAAATAGTGGC AAAAGAATGTAGCAATAAGA 1000 

1081 GGGGGGGGGGGGACCAAATITATCrTTCrrATGCTrcCC^ 1140 

1141 CTTAAACMCAGAAGCTAATATCCTCACGGAAAA 1200 

1201 CCAATCITATATAATA TftTTAAAT ITCrKrrrACAA 1260 

1261 TATATATTCTGACTTGC 111 1 CI 1 CTGCACTTCTACTATTITrAATTTA'nTGTCACTAT 1320 

1321 TAGGTrATMTMWATXSAATrGCMAAAGArrriTCATAGCAAGTGCATroATATCACTAA 1380 

1381 TG TC 1 1 1 (J 1 1 ACCTAGCGTATG 1 1 1 1TCTGAATCAATACATGAAQATAATATAAATGGTA 1440 

1441 ACTITrACATTAGTGCAAAGTATATGCCAAGTGCCTCACACTTTGGCGTATTT^ 1500 

1501 AAGAAGAGAAAMCACAACAACTGGAGTTTTCGGATrAAAACAAGATTGGGACGGAGCAA 1560 

1561 CACTAAAGGATGCAAGCWGCAGCCACACAOTAGACCCAAGTACAATG 1607 

(SEQ ID NO: 1) 
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Eca28SA2 

ATGAATtGTAAAAAAGTTTTCACAATAAGTGCATTGATATCATCCATATACTTCCTACCT 6 0 
MNCKK V FTISALISSIYFLP 

AATGTCTCATACTCTAACCCAGTATATGGTAACAGTATGTATGGTAATTTTTACATATCA 120 
NV CYSNPVY GNSMYGNFYIS 

GGAAAGTACATGCCAAGTGTTCCTCATTTTGGAATTTTTTC AGCTGAAGAAGAGAAAAAA 180 
G KYM PSVPH F.G I FSAEEEKK 

AAG AC AACTGTAGTATATGG CTTAAAAGAAAACTGGGC AGG AG ATGC AAT AT CTAG TCAA 240 
KTTVVYGLKENWAGDAISSQ 

AGTCCAGATGATAATTTTACC^TTCGAAATTACTCATTCAAGTATGCAAGC'.J'.C^ A-7TTT 300 
S PDDNFTI RN YS FK YASN K F 

TTAGGGTTTGCAGTAGCTATTGGTTACTCGATAGGC AGTCCAAG AATAGAAGTTGAG ATG 360 
LGFAVAIGYS IGSPRI EV EM 

TCTTATGAAGCATTTGATGTGAAAAATCGAGGTGATAATTACAAAAACGGTGCTTACAGG 420 
SYEAFDV KNPGDNYKNGAYR. 

TATTGTGCTTTATCTCATCAAGATGATGCGGATGATGACATGACTAGTGCAACTGACAAA 480 
YCALSHQDDA DDDMTSATDK 

TTTGTATATTTAATTAATGAAGGATTACTTAACATATCATTTATGACAAACATATGTTAT 54 O 
FVYLINEGLLNISFMTNI C Y 

GAAACAGCAAGCAAAAATATACCTCTCTCTCCTTACATATGTGCAGGTATTGGTACTGAT 600 
ETA SKNIP LSPYICAG IGTD 

TTAATTCACATGTTTGAAACTACACATCCTAAAATTTCTTATCAAGGAAAGCTAGGGTTG 660 
L IHM- FETTHP KI SYQGKLGL 

GCCTACTTCGTAAGTGCAGAGTCTTCGGTT^^ ATA 72 0 

AYFVSAE SSVS FGIYFHK I I 

AATAATAAGTTTAAAAATGTTCCAGCCATGGTACCTATTAACTCAGACGAGATAGTAGGA 780 
NNKFKNVPAMVPIHSDEIVG 

CC^CAGTTTGCAACAGTAACATTAAATGTATGCTACTTTGGATTAGAACTTGGATGTAGG 84 O 

PQF..ATVTLHVCYP6LBLG C R 
* (SEQ ID NO: 3) 

TTCAACrrCTAATTTCGTGGTACACATATCACGAAGCTAAAATTGTTTTTTTATCTCTGC 900 
F N F * (SEQ ID NO: 4) 

TGTATACAAGAGAAAAAATAGTAGTGAAAATTACCTAACAATATGACAGTACAAGTTTAC 960 
CAAGCTrATTCTCACAAAACTTCTTGTGTCTTTTATCTCTTTACAATGAAATGTACACTT 1020 

FIG. 7-1 
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AGCTTC^CTACTGTAGAGTGTGTTTATCAATGCTTTGTTTATTAATACTCTACATAATAT 1080 
GTTAAATTTTTCTTACAAAACTCACTAGTAATTTATACTAGAATATATATTCTGACTTGT 114 0 
ECa28SA3 tSEQ ID NO: 31) 

ATTTGCTTTATACITCCACrATTGTTAATTTATTTTCACTATTTTAGGTGTAA'^TGAAT 1200 

M N 

TGCAAAAAAATTCTTATAACAACTGCATTAATGTCATTAATGTACTATGCTCCAAGCATA 1260 
CKKILITTALM SLMYYAPS I 

TCTTTTTCTGATACTATACAAGACGATAACACTGGTAGCTTCTACATCAGTGGAAAATAT 1320 
SFSDTIQDDNTGSFYISGKY 

GTACCAAGTGTTTCACATTTTGGTGTTTTCTCAGCTAAAGAAGAAAGAAACTCAACTGTT 1380 
VP SVSHFG VFSAKEERNSTV 

GGAGTTTTTGGATTAAAACATGATTGGAATGGAGGTACAATATCTAACTCTTCTCCAGAA 1440 
GVFGLKHDWMGGTI SNSS P E 

AATATATTCACAGTTCAAAATTATTCGTTTAAATACGAAAACAACCCATTCTTAGGGTTT 1500 
N I FTVQNYS FKYENNPFLG F. 

GCAGGAGCTATTGGTTATTCAATGGGTGGCCCAAGAATAGAACTTGAAGTTCTGTACGAG 1 r 5 0 
AGAIGY S MGGPRIELEVLYE 

ACATTCGATGTGAAAAATCAGAACAATAATTATAAGAACGGCGCACACAGATACTGTGCT 1620 
TFDVKNQNNNYKNGAHRYCA 

TTATCTCATCATAGTTCAGCAACAAGCATGTCCTCCGCAAGTAACAAATTTGTTTTCTTA 1680 
L S HH S S ATS M S S AS N K FV F L 

AAAAATGAAGGGTTAATTGACTTATCATTTATGATAAATGCATGCTATGACATAATAATT 1740 
KNEGLIDLSFMINACYDI II 

GAAGGAATGCCTITTTCACCTTATATTTGTGCAGGTGTTGGTACTGATGTTGTTTCCATG 1800 
EGMPFSPYICAGVGTD VVSM 

TTTGAAGCTATAAATCCTAAAATTTCTTACCAAGGAAAACTAGGATTAGGTTATAGTATA 1860 
FEA INPKISYQGKLGL GYC I 

AGTTCAGAAGCCTCTGTTTTTATCr^TGGACACTTTCACAGAGTCATAGGTAA^ 1920 
S5BASVFIGGHFHRVIGNEF 

AGAGACATCCCTGCTATGGTTCCTAGTGGATCAAATCTTCCAGAAAACCAATTTGCAATA 1 S 8 0 
RDIPAMVPSGSNLPEHQFAI 

(SEQ ID NO^ 5> 

GTAACACTAAATGTGTGTCACTTTGGCATAGAACTTGGAGGAAGATTTAACTTCTGA 203 1 
VTL NVCHFGI -B-LGGRFNF * 

FIG.7-2 <»0 =»».«> 
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SEQUENCE LISTING 

<110> Walker, David K. 

McBride, Jere W. 
Yu, Xue-Jie 

<120> Homologous 28-Kilodalton Iirununodominant Protein 

Genes of Ehrlichia canis and Uses Thereof 

<130> D6152PCT 

<141> 1999-11-30 

<150> 09/261,358 

<151> 1999-03-03 

<160> 33 

<210> 1 

<211> 1607 

<212> DNA 

<213> Ehrlichia canis 

<220> 

<223> nucleic acid sequence of ECa28-l 



<400> 


1 










attttattta 


ctaccaatcc 


tatataatat 


attaaatctc 


tcttacaaaa 


50 


atctctaatg 


ttttatacct 


aatatatata 


ttctggcttg 


tatctacttt 


100 


gcacttccac 


tattgttaat 


ttattttcac 


tattttaggt 


gtaatatgaa 


150 


ttgcaaaaaa 


attcttataa 


caactgcatt 


aatatcacta 


atgtactcta 


200 


ttccaagcat 


atctctttct 


gatactatac 


aagatggtaa 


catgggtggt 


250 


aacttctata 


ctagcggaaa 


gtatgcacca 


agcgtctcac 


attttggtag 


300 


cttctcagct 


aaagaagaaa 


gcaaatcaac 


tgctggagtt 


tttggattaa 


350 


aacatgactg 


ggatggaagt 


ccaataccta 


agaataaaca 


cgctgacttt 


400 


actgttccaa 


actattcgtt 


cagatacgag 


aacaatccat 


ttctagggtt 


450 


tgcaggagct 


atcggttact 


caacgggtgg 


cccaagaata 


gaattcgaaa 


500 


tatcttacga 


agcattcgac 


gtaaaaagtc 


ctaatatcaa 


ttatcaaaat 


550 


gacgcgcaca 


ggtactgcgc 


tctatctcat 


cacacatcgg 


cagccacgga 


600 


agctgataaa 


tttgccttct 


taaaaaacga 


agggttaatt 


gacatatcac 


650 


ttgcaataaa 


tgcatgttat 


gatataataa 


atgacaaagt 


acctgtttct 


700 


ccttatatat 


gcgcaggtat 


tggtactgat 


ttgatttcta 


tgtttgaagc 


750 


tacaagtcct 


aaaacttcct 


accaaggaaa 


actgggcatt 


agttactcta 


800 


ttaatccgga 


aacctctgtt 


ttcatcggtg 


ggcatttcca 


caggatcata 


850 


ggtaatgagt 


ttagagatat 


tcctgcaata 


gtacctagta 


actcaactac 


900 




ccacaattrtg 


caacagtaac 


actraaatgrxj 


rgtcaccrtg 


950 
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gtttagaact 


tggaggaaga 


tttaacttct 


aattttattg 


ttgecacata 


1000. 


ttaaaaatga 


tctaaacttg 


tttttawtat 


tgetacatae 


aaaaaaagaa 


1050 


aaatagtggc 


aaaagaatgt 


ageaataaga gggggggggg 


ggaccaaatt 


1100 


tatcttctat 


gcttcccaag 


ttttttcycg ctatttatga 


cttaaacaac 


1150 


agaaggtaat 


a tec tea egg 


aaaacttatc 


ttcaaatatt 


ttatttatta 


1200 


ccaatcttat 


ataatatatt 


aaatttctct 


racaaaaatc 


actagtattt 


1250 


tataccaaaa 


tatatattct 


gaettgettt 


tcttctgcac 


ttctactatt 


1300 


tttaatttat 


ttgtcactat 


taggttataa 


taawatgaat 


tgemaaagat 


1350 


ttttcatagc 


aayuyCaULy 


atatcactaa 


tgtctttctt 


r5 c c t~ a a c ct t a 


1400 


tctttttctg 


aatcaataca 


tgaagataat 


ataaatggta 


acttttacat 


1450 


tagtgcaaag 


tatatgecaa 


gtgcctcaca ctttggcgta 


ttttcagtta 


1500 


cLciy aciy ay aa 




actggagttt 


teggattaaa 


acaaga t tgg 


1550 


gacggagcaa 


cactaaagga 


tgcaagcwgc 


agccacacaw 


tagacccaag 


1600 


tacaatg 










1607 


<21G> 


2 










<211> 


278 










<212> 


PRT 











<213> Ehrlichia canis 

<220> 

<223> amino acid sequence of ECa28-l protein 

<400> 2 

Met Asn Cys Lys Lys lie Leu lie Thr l:.r Ala Leu lie Ser Leu 

5 10 15 

Met Tyr Ser lie Pro Ser lie Ser Phe Ser Asp Thr lie Gin Asp 

20 25 30 

Gly Asn Met Gly Gly Asn Phe Tyr lie Ser Gly Lys Tyr Val Pro 

35 40 45 

Ser Val Ser His Phe Gly Ser Phe Ser Ala Lys Glu Glu Ser Lys 

50 55 60 

Ser Thr Val Gly Val Phe Gly Leu Lys His Asp Trp Asp Gly Ser 

65 70 75 

Pro lie Leu Lys Asn Lys His Ala Asp Phe Thr Val Pro Asn Tyr 

80 85 90 

Ser Phe Arg Tyr Glu Asn Asn Pro Phe Leu Gly Phe Ala Gly Ala 

95 100 105 

lie Gly Tyr Ser Met Gly Gly Pro Arg lie Glu Phe Glu He Ser 

110 115 ! 120 



SEQ2/24 



WO 00/32745 



PCT/US99/28075 



Tyr Glu Ala Phe Asp Val Lys Ser Pro Asn lie Asn Tyr Gin Asn 

125 130 135 

Asp Ala His Arg Tyr Cys Ala Leu Ser His His Thr Ser Ala Ala 

140 ' -.145 150 

Met Glu Ala Asp Lys Phe Val Phe Leu Lys Asn Glu Gly Leu lie 

155 160 165 

Asp He Ser Leu Ala He Asn Ala Cys Tyr Asp He He Asn Asp 

170 175 180 

Lys Val Pro Val Ser Pro Tyr He Cys Ala Gly He Gly Thr Asp 

185 190 195 

Leu He Ser Met Phe Glu Ala Thr Ser Pro Lys He Ser Tyr Gin 

200 * 205 210 

Gly Lys Leu Gly He Ser Tyr Ser He Asn Pro Glu Thr Ser Val 

215 220 225 

Phe He Gly Gly His Phe His Arg He He Gly Asn Glu Phe Arg 

230 235 240 

Asp lie Pro Ala He Val Pro Ser Asn Ser Thr Thr He Ser Gly 

245 250 255 

Pro Gin Phe Ala Thr Val Thr Leu Asn Val Cys His Phe Gly Leu 

260 265 270 

Glu Leu Gly Gly Arg Phe Asn Phe 

275 



<210> 3 

<211> 849 

<212> DNA 

<213> Ehrlichia canis 

<220> 

<221> mat_peptide 

<223> nucleic acid sequence of ECa28SA2 

<400> 3 

atgaattgta aaaaagtttt cacaataagt gcattgatat catccatata 50 
cttcctacct aatgtctcat actctaaccc agtatatggt aacagtatgt 100 
atggtaattt ttacatatca ggaaagtaca tgccaagtgt tcctcatttt 150 
ggaatttttt cagctgaaga agagaaaaaa aagacaactg tagtatatgg 200 
cttaaaagaa aactgggcag gagatgcaat atctagtcaa agtccagatg 250 
ataattttac cattcgaaat tactcattca agtatgcaag caacaagttt 300 



SEQ 3/24 



W ° 0002745 PCT/US99/28075 

ttagggtttg cagtagctat tggctactcg ataggcagtc caagaacaga 350 
agttgagatg tcttatgaag catctgatgt gaaaaatcca ggtgataact 400 
acaaaaacgg tgcttacagg tattgtgctt tatctcatca agatgacgcg 450 
gatgatgaca tgactagtgc aactgaeaaa tttgtatatt taattaatga 500 
aggattactt aacatatcat ttatgacaaa catatgttat gaaacagcaa 550 
gcaaaaatat acctctctct ccttacatat gtgcaggtat tggtactgat 600 
ttaattcaca tgtttgaaac tacacatcct aaaatttctt atcaaggaaa 650 
gctagggttg gcctacttcg taagtgcaga gtcttcggtt tcttttggta 700 
tatattttca taaaattata aataataagt ttaaaaatgt tccagccatg 750 
gtacctatta actcagacga gatagtagga ccacagtttg caacagtaac 800 
attaaatgta tgctactttg gattagaact tggatgtagg ttcaacttc 849 

<210> 4 

<211> 283 

<212> PRT 

<213> Ehrlichia canis 

<220> 

<223> amino acid sequence of ECa28SA2 protein 

<400> 4 

Met Asn Cys Lys Lys Val Phe Thr lie Ser Ala Leu He Ser Ser 

5 10 15 

He Tyr Phe Leu Pro Asn Val Ser Tyr Ser Asn Pro Val Tyr Gly 

20 25 30 

Asn Ser Met Tyr Gly Asn Phe Tyr He Ser Gly Lys. Tyr Met Pro 

35 40 45 

Ser Val Pro His Phe Gly lie Phe Ser Ala Glu Glu Glu Lys Lys 

50 55 60 

Lys Thr Thr Val Val Tyr Gly Leu Lys Glu Asn Trp Ala Gly Asp 

65 70 75 

Ala He Ser Ser Gin Ser Pro Asp Asp Asn Phe Thr He Arg Asn 

80 85 90 

Tyr Ser Phe Lys Tyr Ala Ser Asn Lys Phe Leu Gly Phe Ala Val 

95 100 105 

Ala He Gly Tyr Ser He Gly Ser Pro Arg He Glu Val Glu Met 

110 115 120 

Ser Tyr Glu Ala Phe Asp Val Lys Asn Pro Gly Asp Asn Tyr Lys 

125 130 135 



SEQ4/24 



W ° 00,32745 PCT/US99/28075 

Asn Gly Ala Tyr Arg Tyr Cys Ala Leu Ser His Gin Asp Asp Ala 
140 145 150 

Asp Asp Asp Met Thr Ser Ala Thr Asp Lys Phe Val Tyr Leu lie 
155 160 165 

Asn Glu Gly Leu Leu Asn lie Ser Phe Met Thr Asn lie Cys Tyr 

170 175 180 

Glu Thr Ala Ser Lys Asn lie Pro Leu Ser Pro Tyr lie Cys Ala 

185 190 195 

Gly lie Gly Thr Asp Leu lie His Met Phe Glu Thr Thr His Pro 

200 205 210 

Lys lie Ser Tyr Gin Gly Lys Leu Gly Leu Ala Tyr Phe Val Ser 

215 220 225 

Ala Glu Ser Ser Val Ser Phe Gly lie Tyr Phe His Lys lie lie 

230 235 240 

Asn Asn Lys Phe Lys Asn Val Pro Ala Met Val Pro He Asn Ser 

245 250 255 

Asp Glu He Val Gly Pro Gin Phe Ala Thr Val Thr Leu Asn Val 

260 265 270 

Cys Tyr Phe Gly Leu Glu Leu Gly Cys Arg Phe Asn Phe 

275 280 

<210> 5 

<211> 840 

<212> DNA 

<213> Ehrlichia canis 

<220> 

<221> mat_peptide 

<223> nucleic acid sequence of ECa28SA3 

<400> 5 



atgaattgca 
ctatgctcca 
gtagcttcta 
gttttctcag 
aaaacatgat 
tattcacagt 
gggtttgcag 
tgaagttctg 



aaaaaattct 
agcatatctt 
catcagtgga 
ctaaagaaga 
tggaatggag 
tcaaaattat 
gagctattgg 
tacgagacat 



tataacaact 
tttctgatac 
aaatatgtac 
aagaaactca 
gtacaatatc 
tcgtttaaat 
ttattcaatg 
tcgatgtgaa 



gcattaatgt: 
tatacaagac 
caagtgtttc 
actgttggag 
taactcttct 
acgaaaacaa 
ggtggcccaa 
aaatcagaac 



cattaatgta 50 
gataacactg 100 
acattttggt 150 
tttttggatt 200 
ccagaaaata 250 
cccattctta 300 
gaatagaact 350 
aataattata 400 



SEQ 5/24 



WO 0002745 PCT/US99/28075 

agaacggcgc acacagatac tgtgctrtat ctcatcatag ttcagcaaca 450 
agcatgtcct ccgcaagtaa caaatttgtt ttcttaaaaa atgaagggtt 500 
aattgactta tcatttatga taaatgcatg ctatgacata ataattgaag 550 
gaatgccttt ttcaccttat atttgtgcag gtgttggtac tgatgttgtt 600 
tccatgtttg aagctataaa tcctaaaatt tcttaccaag gaaaactagg 650 
attaggttat agtataagtt cagaagcctc tgtttttatc ggtggacact 700 
ttcacagagt cataggtaat gaatttagag acatccctgc tatggttcct 750 
agtggatcaa atcttccaga aaaccaattt gcaatagcaa cactaaatgt 800 
gtgtcacttt ggcatagaac ttggaggaag atttaacttc 840 



<210> 6 

<211> 280 

<212> PRT 

<213> Ehrlichia canis 

<220> 

<223> amino acid sequence of ECa28SA3* protein 

<400> 6 

Met Asn Cys Lys Lys lie Leu lie Thr Thr Ala Leu Met Ser Leu 

5 10 15 

Met Tyr Tyr Ala Pro Ser lie Ser Phe Ser Asp Thr lie Gin Asp 

20 25 30 

Asp Asn Thr Gly Ser Phe Tyr lie Ser Gly Lys Tyr Val Pro Ser 

35 40 45 

Val Ser His Phe Gly Val Phe Ser Ala Lys Glu Glu Arg Asn Ser 

50 55 60 

Thr Val Gly Val Phe Gly Leu Lys His Asp Trp Asn Gly Gly Thr 

65 70 75 

lie Ser Asn Ser Ser Pro Glu Asn lie Phe Thr Val Gin Asn Tyr 

80 85 90 

Ser Phe Lys Tyr Glu Asn Asn Pro Phe Leu Gly Phe Ala Gly Ala 

95 100 105 

He Gly Tyr Ser Met Gly Gly Pro Arg He Glu Leu Glu Val Leu 

110 115 120 

Tyr Glu Thr Phe Asp Val Lys Asn Gin Asn Asn Asn Tyr Lys Asn 

125 130 135 

Gly Ala His Arg Tyr Cys Ala Leu Ser His His Ser Ser Ala Thr 

140 145 150 



SEQ6/24 



WO 00/32745 PCT/US99/2S075 

Ser Met Ser Ser Ala Ser Asn Lys Phe Val Phe Leu Lys Asn Glu 

155 160 165 

Gly Leu lie Asp Leu Ser Phe Met lie Asn Ala Cys Tyr Asp lie 

170 175 180 

lie lie Glu Gly Met Pro Phe Ser Pro Tyr lie Cys Ala Gly Val 

185 190 195 

Gly Thr Asp Val Val Ser Met Phe Glu Ala lie Asn Pro Lys lie 

200 205 210 

Ser Tyr Gin Gly Lys Leu Gly Leu Gly Tyr Ser lie Ser Ser Glu 

215 220 225 

Ala Ser Val Phe He Gly Gly His Phe His Arg Val He Gly Asn 

230 235 240 

Glu Phe Arg Asp He Pro Ala Met Val Pro Ser Gly Ser Asn Leu 

245 250 255 

Pro Glu Asn Gin Phe Ala He Val Thr Leu Asn Val Cys His Phe 

260 265 270 

Gly He Glu Leu Gly Gly Arg Phe Asn Phe 

275 280 

<210> 7 

<211> 133 

<212> PRT 

<213> Ehrlichia canis 

<220> 

<223> partial amino acid sequence of ECa28SA2 protein 

<400> 7 

Met Asn Cys Lys Lys Val Phe Thr He Ser Ala Leu lie Ser Ser 

5 10 15 

He Tyr Phe Leu Pro Asn Val Ser Tyr Ser Asn Pro Val Tyr Gly 

20 25 30 

Asn Ser Met Tyr Gly Asn Phe Tyr lie Ser Gly Lys Tyr Met Pro 

35 40 45 

Ser Val Pro His Phe Gly lie Phe Ser Ala Glu Glu Glu Lys Lys 

50 55 60 

Lys Thr Thr Val Val Tyr Gly Leu Lys Glu Asn Trp Ala Gly Asp 

65 70 75 

Ala He Ser Ser Gin Ser Pro Asp Asp Asn Phe Thr He Arg Asn 

80 85 90 



SEQ7/24 



WO 00/32745 PCT/US99/28075 

Tyr Ser Phe Lys Tyr Ala Ser Asn Lys Phe Leu Gly ?he Ala Val 

95 100 105 

Ala lie Gly Tyr Ser lie Gly Ser Pro Arg lie Glu Val Glu Met 

110 115 120 

Ser Tyr Glu Ala Phe Asp Val Lys Asn Gin Gly Asn Asn 
125 130 

<210> 8 

<211> 287 

<212> PRT 

<213> Ehrlichia canis . 

<220> 

<223> amino acid sequence of ECa28SAl protien 
<400> 8 

Met Lys Tyr Lys Lys Thr Phe Thr Val Thr Ala Leu Val Leu Leu 

5 10 15 
Thr Ser Phe Thr His Phe lie Pro Phe Tyr Ser Pro Ala Arg Ala 

20 25 30 

Ser Thr lie His Asn Phe Tyr lie Ser Gly Lys Tyr Met Pro Thr 

35 40 45 

Ala Ser His Phe Gly lie Phe Ser Ala Lys Glu Glu Gin Ser Phe 

50 55 60 

Thr Lys Val Leu Val Gly Leu Asp Gin Arg Leu Ser His Asn lie 

65 70 75 

lie Asn Asn Asn Asp Thr Ala Lys Ser Leu Lys Val Gin Asn Tyr 

80 85 90 

Ser Phe Lys Tyr Lys Asn Asn Pro Phe Leu Gly Phe Ala Gly Ala 

95 100 105 

lie Gly Tyr Ser lie Gly Asn Ser Arg He Glu Leu Glu Val Ser 

110 115 120 

His Glu He Phe Asp Thr Lys Asn Pro Gly Asn Asn Tyr Leu Asn 

125 130 135 

Asp Ser His Lys Tyr Cys Ala Leu Ser His Gly Ser His He Cys 

140 145 150 

Ser Asp Gly Asn Ser Gly Asp Trp Tyr Thr Ala Lys Thr Asp Lys 

155 160 165 

Phe Val Leu Leu Lys Asn Glu Gly Leu Leu Asp Val Ser Phe Met 

170 175 180 



SEQ8/24 



WO 00/32745 PCT/US9 9/28075 

Leu Asn Ala Cys Tyr Asp lie Thr Thr Glu Lys Met Pro Phe Ser 

185 190 195 

Pro Tyr He Cys Ala Gly lie Giy Thr Asp Leu lie Ser Met Phe 

200 205 210 

Glu Thr Thr Gin Asn Lys He Ser Tyr Gin Gly Lys Leu Gly Leu 

215 220 225 

Asn Tyr Thr He Asn Ser Arg Val Ser Val Phe Ala Gly Gly His 

230 235 240 

Phe His Lys Val He Gly Asn Glu Phe Lys Gly He Pro Thr Leu 

245 250 255 

Leu Pro Asp Gly Ser Asn He Lys Val Gin Gin Ser Ala Thr Val 

260 265 270 

Thr Leu Asp Val Cys His Phe Gly Leu Glu He Gly Ser Arg Phe 

275 280 285 

Phe Phe 



<210> 9 
<211> 281 
<212> PRT 

<213> Ehrlichia chaffeensis 

<220> 

<223> amino acid sequence of E. chaffeensis P28 

<400> 9 

Met Asn Tyr Lys Lys Val Phe He Thr Ser Ala Leu He Ser Leu 

5 10 15 

He Ser Ser Leu Pro Gly Val Ser Phe Ser Asp Pro Ala Gly Ser 

20 25 30 

Gly He Asn Gly Asn Phe Tyr He Ser Gly Lys Tyr Met Pro Ser 

35 40 45 

Ala Ser His Phe Gly Val Phe Ser Ala Lys Glu Glu Arg Asn Thr 

50 55 60 

Thr Val Gly Val Phe Gly Leu Lys Gin Asn Trp Asp Gly Ser Ala 

65 70 75 

He Ser Asn Ser Ser Pro Asn Asp Val Phe Thr Val Ser Asn Tyr 

80 85 90 

Ser Phe Lys Tyr Glu Asn Asn Pro Phe Leu Gly Phe Ala Gly Ala 

95 100 105 



SEQ9/24 



WO 00/32745 



PCT/US99/28075 



lie Gly Tyr Ser Met Asp Gly Pro Arg He Glu Leu Glu Val Ser 

110 115 120 

Tyr Glu Thr Phe Asp Val Lys Asn Gin Gly Asn Asn Tyr Lys Asn 

125 130 135 

Glu Ala His Arg Tyr Cys Ala Leu Ser His Asn Ser Ala Ala Asp 

140 145 150 

Met Ser Ser Ala Ser Asn Asn Phe Val Phe Leu Lys Asn Glu Gly 

155 160 165 

Leu Leu Asp He Ser Phe Met Leu Asn Ala Cys Tyr Asp Val Val 

170 175 180 

Gly Glu Gly He Pro Phe Ser Pro Tyr He Cys Ala Gly He Gly 

185 190 195 

Thr Asp Leu Val Ser Met Phe Glu Ala Thr Asn Pro Lys He Ser 

200 205 210 

Tyr Gin Gly Lys Leu Gly Leu Ser Tyr Ser lie Ser Pro Glu Ala 

215 220 225 

Ser Val Phe He Gly Gly His Phe His Lys Val He Gly Asn Glu 

230 235 240 

Phe Arg Asp He Pro Thr He He Pro Thr Gly Ser Thr Leu Ala 

245 250 255 

Gly Lys Gly Asn Tyr Pro Ala lie Val He Leu Asp Val Cys His 

260 265 270 

Phe Gly He Glu' Leu Gly Gly Arg Phe Ala Phe 

275 280 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 
<400> 



10 

283 

PRT 

Ehrlichia chaffeensis 

amino acid sequence of E. chaffeensis 0MP-1B 
10 



Met Asn Tyr Lys Lys He Phe Val Ser Ser Ala Leu lie Ser Leu 
5 10 15 

Met Ser lie Leu Pro Tyr Gin Ser Phe Ala Asp Pro Val Thr Ser 
20 25 30 



SEQ 10/24 



WO 00/32745 PGT/US99/28075 

Asn Asp Thr Gly lie Asn Asp Ser Arg Glu Gly Phe Tyr lie Ser 

35 40 45 

Val Lys Tyr Asn Pro Ser lie Ser His Phe Arg Lys Phe Ser Ala 

50 55 60 

Glu Glu Ala Pro lie Asn Gly Asn Thr Ser lie Thr Lys Lys Val 

65 70 75 

Phe Gly Leii Lys Lys Asp Gly Asp lie Ala Gin Ser Ala Asn Phe 

80 85 90 

Asn Arg Thr Asp Pro Ala Leu Glu Phe Gin Asn Asn Leu lie Ser 

95 100 105 

Gly Phe Ser Gly Ser lie Gly Tyr Ala Met Asp Gly Pro Arg lie 

110 115 120 

Glu Leu Glu Ala Ala Tyr Gin Lys Phe Asp Ala Lys Asn Pro Asp 

125 130 135 

Asn Asn Asp Thr Asn Ser Gly Asp Tyr Tyr Lys Tyr Phe Gly Leu 

140 145 150 

Ser Arg Glu Asp Ala lie Ala Asp Lys Lys Tyr Val Val Leu Lys 

155 160 165 

Asn Glu Gly lie Thr Phe Met Ser Leu Met Val Asn Thr Cys Tyr 

170 175 180 

Asp lie Thr Ala Glu Gly Val Pro Phe lie Pro Tyr Ala Cys Ala 

185 190 195 

Gly Val Gly Ala Asp Leu lie Asn Val Phe Lys Asp Phe Asn Leu 

200 205 210 

Lys Phe Ser Tyr Gin Gly Lys lie Gly lie Ser Tyr Pro lie Thr 

215 220 225 

Pro Glu Val Ser Ala Phe lie Gly Gly Tyr Tyr His Gly Val lie 

230 235 240 

Gly Asn Asn Phe Asn Lys lie Pro Val lie Thr Pro Val Val Leu 

245 250 255 

Glu Gly Ala Pro Gin Thr Thr Ser Ala Leu Val Thr lie Asp Thr. 

260 265 270 

Gly Tyr Phe Gly Gly Glu Val Gly Val Arg Phe Thr Phe 

275 280 

<210> 11 
<211> 280 



SEQ 1 1/24 



W ° 0002745 PCT/US99/28675 

<212> PRT 

<213> Ehrlichia chaffeensis 

<220> 

<223> amino acid sequence of E. chaffeensis 0MP-1C 

<400> 11 

Met Asn Cys Lys Lys Phe Phe He Thr Thr Ala Leu Ala Leu Pro 

5 10 15 

Met Ser Phe Leu Pro Gly He Leu Leu Ser Glu Pro Val Gin Asp 

20 25 ' 30 

Asp Ser Val Ser Gly Asn Phe Tyr He Ser Gly Lys Tyr Met Pro 

35 40 45 

Ser Ala Ser His Phe Gly Val Phe Ser Ala Lys Glu Glu Lys Asn 

50 55 60 

Pro Thr Val Ala Leu Tyr Gly Leu Lys Gin Asp Trp Asn Gly Val 

65 70 75 

Ser Ala Ser Ser His Ala Asp Ala Asp Phe Asn Asn Lys Gly Tyr 

80 85 90 

Ser Phe Lys Tyr Glu Asn Asn Pro Phe Leu Gly Phe Ala Gly Ala 
95 100 105 

lie Gly Tyr Ser Met Gly Gly Pro Arg He Glu Phe Glu Val Ser 
HO 115 120 

Tyr Glu Thr Phe Asp Val Lys Asn Gin Gly Gly Asn Tyr Lys Asn 
125 130 135 

Asp Ala His Arg Tyr Cys Ala Leu Asp Arg Lys Ala Ser Ser Thr 
140 145 150 

Asn Ala Thr Ala Ser His Tyr Val Leu Leu Lys Asn Glu Gly Leu 
155 160 165 

Leu Asp He Ser Leu Met Leu Asn Ala Cys Tyr Asp Val Val Ser 
170 175 • 180 

Glu Gly lie Pro Phe Ser Pro Tyr He Cys Ala Gly Val Gly Thr 
185 190 195 

Asp Leu He Ser Met Phe Glu Ala He Asn Pro Lys He Ser Tyr 
200 205 210 

Gin Gly Lys Leu Gly Leu Ser Tyr Ser He Asn Pro Glu Ala Ser 
215 220 225 

Val Phe Val Gly Gly His Phe His Lys Val Ala Gly Asn Glu Phe 
230 235 ' 240 



SEQ 12/24 



WO 00/32745 



PCT/US99/28075 



Arg Asp lie Ser Thr Leu Lys Ala Phe Ala Thr Pro Ser Ser Ala 
245 250 255 

Ala Thr Pro Asp Leu Ala Thr Val Thr Leu Ser Val Cys His Phe 
260 265 270 

Gly Val Glu Leu Gly Gly Arg Phe Asn Phe 
275 280 



<210> 


12 


<211> 


286 


<212> 


PRT 


<213> 


Ehrlichia chaffeensis 


<220> 




<223> 


amino acid sequence of 


<400> 


12 



Met Asn Cys Glu Lys Phe Phe lie Thr Thr Ala Leu Thr Leu Leu 

5 10 15 

Met Ser Phe Leu Pro Gly lie Ser Leu Ser Asp Pro Val Gin Asp 

20 25 30 

Asp Asn lie Ser Gly Asn Phe Tyr lie Ser Gly Lys Tyr Met Pro 

35 40 45 

Ser Ala Ser His Phe Gly Val Phe Ser Ala Lys Glu Glu Arg Asn 

50 55 60 

Thr Thr Val Gly Val Phe Gly He Glu Gin Asp Trp Asp Arg Cys 

65 70 75 

Val He Ser Arg Thr Thr Leu Ser Asp He Phe Thr Val Pro Asn 

80 85 90 

Tyr Ser Phe Lys Tyr Glu Asn Asn Leu Phe Ser Gly Phe Ala Gly 

95 100 105 

Ala He Gly Tyr Ser Met Asp Gly Pro Arg He Glu Leu Glu Val 

110 115 120 

Ser Tyr Glu Ala Phe Asp Val Lys Asn Gin Gly Asn Asn Tyr Lys 

125 130 135 

Asn Glu Ala His Arg Tyr Tyr Ala Leu Ser His Leu Leu Gly Thr 

140 145 150 

Glu Thr Gin He Asp Gly Ala Gly Ser Ala Ser Val Phe Leu He 

155 160 165 



SEQ 13/24 



WO 00/32745 



PCT/US99/28075 



Asn Glu Gly Leu Leu Asp Lys 
170 

Asp Val lie Ser Glu Gly He 
185 

Gly lie Gly He Asp Leu Val 
200 

Lys He Ser Tyr Gin Gly Lys 
215 

Pro Glu Ala Ser Val Phe lie 
230 

Gly Asn Glu Phe Arg Asp lie 
245 

Ala Leu Ala Gly Lys Gly Asn 
260 

Val Phe Tyr Phe Gly He Glu 
275 

Leu 



Ser Phe Met Leu Asn Ala Cys Tyr 
175 . 180 

Pro Phe Ser Pro Tyr He Cys Ala 
190 195 
Ser Met Phe Glu Ala He Asn Pro 
205 210 
Leu Gly Leu Ser Tyr Pro He Ser 
220 225 
Gly Gly His Phe His Lys Val He 
235 240 
Pro Thr Met He Pro Ser Glu Ser 
250 255 
Tyr Pro Ala He Val Thr Leu Asp 
265 270 
Leu Gly Gly Arg Phe Asn Phe Gin 
280 285 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 
<400> 

Met Asn Cys 
Met Ser Phe 
Asp Asn He 
Ser Ala Ser 
Pro Thr Val 
Ser Ser Ser 



13 

278 

PRT 

Ehrlichia chaffeensis 

amino acid sequence of E, chaffeensis OMP-lE 
13 

Lys Lys Phe Phe He Thr Thr Ala Leu Val Ser Leu 

5 10 15 

Leu Pro Gly He Ser Phe Ser Asp Pro Val Gin Gly 

20 25 30 

Ser Gly Asn Phe Tyr Val Ser Gly Lys Tyr Met Pro 

35 40 45 

His Phe Gly Met Phe Ser Ala Lys Glu Glu Lys Asn 

50 55 60 

Ala Leu Tyr Gly Leu Lys Gin Asp Tip Glu Gly He 

65 70 75 

Ser His Asn Asp Asn His Phe Asn Asn Lys Gly Tyr 

80 85 90 



SEQ 14/24 



WO 00/32745 



PCT/US99/28075 



Ser Phe Lys Tyr Glu Asn Asn Pro Phe Leu Gly Phe Ala Gly Ala 

95 100 105 

lie Gly Tyr Ser Met Gly Gly Pro Arg Val Glu Phe Glu Val Ser 

110 115 120 

Tyr Glu Thr Phe Asp Val Lys Asn Gin Gly Asn Asn Tyr Lys Asn 

125 130 135 

Asp Ala His Arg Tyr Cys Ala Leu Gly Gin Gin Asp Asn Ser Gly 

140 145 150 

lie Pro Lys Thr Ser Lys Tyr Val Leu Leu Lys Ser Glu Gly Leu 

155 160 165 

Leu Asp lie Ser Phe Met Leu Asn Ala Cys Tyr Asp lie lie Asn 

170 175 180 

Glu Ser lie Pro Leu Ser Pro Tyr lie Cys Ala Gly Val Gly Thr 

185 190 195 

Asp Leu lie Ser Met Phe Glu Ala Thr Asn Pro Lys lie Ser Tyr 

200 205 210 

Gin Gly Lys Leu Gly Leu Ser Tyr Ser lie Asn Pro Glu Ala Ser 

215 220 225 

Val Phe He Gly Gly His Phe His Lys Val He Gly Asn Glu Phe 

230 235 240 

Arg Asp He Pro Thr Leu Lys Ala Phe Val Thr Ser Ser Ala Thr 

245 250 255 

Pro Asp Leu Ala He Val Thr Leu Ser Val Cys His Phe Gly He 

260 265 270 

Glu Leu Gly Gly Arg Phe Asn Phe 

275 

<210> 14 
<211> 280 
<212> PRT 

<213> Ehrlichia chaffeensis 

<220> 

<223> amino acid sequence of E. chaffeensis OMP-1F 

<400> 14 

Met Asn Cys Lys Lys Phe Phe lie Thr Thr Thr Leu Val Ser Leu 
5 10 15 



SEQ 15/24 



WO00O274S PCT/US99V28075 

Met Ser Phe Leu Pro Gly He ser Phe Ser Asp Ala Val Gin Asn 

20 25 30 

Asp Asn Val Gly Gly Asn Phe Tyr He Ser Gly Lys Tyr Val Pro 

3 5 40 45 

Ser Val Ser His Phe Gly Val Phe Ser Ala Lys Gin Glu Arg Asn 

50 55 eo 

Thr Thr Thr Gly Val Phe Gly Leu Lys Gin Asp Trp Asp Gly Ser 

65 .70 ?5 

Thr He Ser Lys Asn Ser Pro Glu Asn Thr Phe Asn Val Pro Asn 

80 85 90 

Tyr Ser Phe Lys Tyr Glu Asn Asn Pro Phe Leu Gly Phe Ala Gly 

95 100 105 , 

Ala Val Gly Tyr Leu Met Asn Gly Pro Arg He Glu Leu Glu Met 

110 115 120 ■ 

Ser Tyr Glu Thr Phe Asp Val Lys Asn Gin Gly Asn Asn Tyr Lys 
125 130 135 

Asn Asp Ala His Lys Tyr Tyr Ala Leu Thr His Asn Ser Gly Gly 
140 145 150 

Lys Leu Ser Asn Ala Gly Asp Lys Phe Val Phe Leu Lys Asn Glu 
155 160 165 

Gly Leu Leu Asp He Ser Leu Met Leu Asn Ala Cys Tyr Asp Val 
170 175 iso 

He Ser Glu Gly He Pro Phe Ser Pro Tyr He Cys Ala Gly Val 
185 190 195 

Gly Thr Asp Leu He Ser Met Phe Glu Ala He Asn Pro Lys He 
200 205 210 

Ser Tyr Gin Gly Lys Leu Gly Leu Ser Tyr Ser He Ser Pro Glu 
215 220 225 

Ala Ser Val Phe Val Gly Gly His Phe His Lys Val He Gly Asn 
230 235 240 

Glu Phe Arg Asp lie Pro Ala Met lie Pro Ser Thr Ser Thr Leu 
245 250 255 

Thr Gly Asn His Phe Thr lie Val Thr Leu Ser Val Cys His Phe 

260 265 270 

Gly Val Glu Leu Gly Gly Arg Phe Asn Phe 
275 280 



SEQ 16/24 



WO 00/32745 




<2l0> 


15 


<211> 


284 


<212> 


PRT 


<213> 


Cowdria ruminantium 


<220> 




<223> 


amino acid sequence 


<400> 


15 



PCT/US99/28075 



C. ruminantium MAP-1 

Met Asn Cys Lys Lys He Phe lie Thr Ser Thr Leu He Ser Leu 

5 10 15 

Val Ser Phe Leu Pro Gly Val Ser Phe Ser Asp Val He Gin Glu 

20 25 30 

Glu Asn Asn Pro Val Gly Ser Val Tyr He Ser Ala Lys Tyr Met 

35 40 45 

Pro Thr Ala Ser His Phe Gly Lys Met Ser He Lys Glu Asp Ser 

50 55 go 

Arg Asp Thr Lys Ala Val Phe Gly Leu Lys Lys Asp Trp Asp Gly 

65 70 75 

Val Lys Thr Pro Ser Gly Asn Thr Asn Ser He Phe Thr Glu Lys 

80 85 go 

Asp Tyr Ser Phe Lys Tyr Glu Asn Asn Pro Phe Leu Gly Phe Ala 
95 100 105 

Gly Ala Val Gly Tyr Ser Met Asn Gly Pro Arg He Glu Phe Glu 
110 115 120 

Val Ser Tyr Glu Thr Phe Asp Val Arg Asn Pro Gly Gly Asn Tyr 
125 130 135 

Lys Asn Asp Ala His Met Tyr Cys Ala Leu Asp Thr Ala Ser Ser 
140 145 150 

Ser Thr Ala Gly Ala. Thr Thr Ser Val Met Val Lys Asn Glu Asn 
155 160 165 

Leu Thr Asp He Ser Leu Met Leu Asn Ala Cys Tyr Asp He Met 
170 175 180 

Leu Asp Gly Met Pro Val Ser Pro Tyr Val Cys Ala Gly He Gly 
185 190 195 

Thr Asp Leu Val Ser Val lie Asn Ala Thr Asn Pro Lys Leu Ser 
200 205 210 

Tyr Gin Gly Lys Leu Gly He Ser Tyr Ser He Asn Pro Glu Ala 
215 220 225 



SEQ 17/24 



WO 00/32745 PCT/US99/28075 

Ser lie Phe lie Gly Gly His Phe His Arg Val lie Gly Asn Glu 

230 235 240 

Phe Lys Asp lie Ala Thr Ser Lys Val Phe Thr Ser Ser Gly Asn 

245 250 255 

Ala Ser Ser Ala Val Ser Pro Gly Phe Ala Ser Ala lie Leu Asp 

260 265 270 

Val Cys His Phe Gly lie Glu He Gly Gly Arg Phe Val Phe 

275 280 



<210> 16 

<211> 20 

<212> DNA 

<213> artificial sequence 
<220> 

<221> primer_bind 

<222> nucleotides 313-332 of C. ruminantium MAP-1, 

also nucleotides 307-326 of E. chaff eensis P28 

<223> forward primer 793 for PCR 

<400> 16 



gcaggagctg ttggttactc 



20 



<210> 17 

<211> 21 

<212> DNA 

<213> artificial sequence 
<220> 

<221> primer_bind 

<222> nucleotides 823-843 of C. ruminant ivm MAP-1, 

also nucleotides 814-834 of E. chaffeensis P28 

<223> reverse primer 1330 for PCR 

<400> 17 



ccttcctcca agttctatgc c 



21 



<210> 18 
<211> 24 
<212> DNA 



SEQ 18/24 



WO 00/32745 



PCT/US99/28075 



<213> 
<220> 
<221> 
<223> 
<400> 



artificial sequence 
primer_bind 

primer 46f , specific for ECa28SA2 gene 
18 



atatacttcc tacctaatgt ctca 



24 



<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 

<400> 



19 
20 
DNA 

artificial sequence 
primer_bind 

primer used for sequencing 28-kDa protein 

in E. canis 

19 



genes 



agtgcagagt cttcggtttc 



20 



<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 

<400> 



20 
18 
DNA 

artificial sequence 
primer_bind 

primer used for sequencing 28-kDa protein genes 

in E. canis 

20 



gttacttgcg gaggacat 



18 



<210> 21 

<211> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<221> primer_band 

<222> nucleotides 687-710 of ECa28-l 



SEQ 19/24 



WO 00/32745 



PCT/US99/28075 



<223> printer 394 for PCR 

<400> 21 



gcatttccac aggatcatag gtaa 



24 



<210> 


22 


<211> 


24 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<221> 


primer_band 


<222> 


nucleotides 710-687 


<223> 


primer 394C for PCR 


<4D0> 


22 



ttacctatga tcctgtggaa atgc 



24 



<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 



<400> 

gagtaaccaa cagctcctgc 



23 . 

20 

DNA 

artificial sequence 
primerjbind 

primer 793C which anneals to a region with 
Eca28-1, used to amplify the intergenic 
region between gene ECa28SA3 and ECa28-l 
23 



20 



<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<222> 
<223> 



24 
24 
DNA 

artificial sequence 
primer_band 

primer EC280M-F complementary to noncoding 
regions adjacent to the open reading frame 



SEQ 20/24 



WO00O2745 



PCT/US99/28075 



<400> 



of ECa2$-l 
24 



tctactttgc acttccacta ttgt 



24 



<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 



<400> 



25 
24 
DNA 

artificial sequence 
primer_band 

primer EC280M-R complementary to noncoding 
regions adjacent to the open reading frame 
of ECa28-l 
25 



attcttttgc cactattttt cttt 



24 



<210> 26 

<211> 25 

<212> DNA 

<213> artificial sequence 
<220> 

<221> primer_bind 

<223> primer ECaSA3-2 corresponding to regions within 

ECa28SA3, used to amplify the intergenic region 
NC3 between gene ECa28SA3 and ECa28-l 
<400> 26 



ctaggattag gttatagtat aagtt 



25 



<210> 
<211> 
<212> 

<213> 
<220> 
<221> 
<223> 



27 
23 
PRT 

Ehrlichia canis 
PEPTIDE 

a predicted N-terminal signal peptide of ECa28-l 



SEQ 21/24 



WO00A32745 



PCT/US99/28075 



<400> 



arid ECa28SA3 
27 



Met Asn Cys Lys Lys He Leu He Thr Thr Ala Leu Met Ser Leu 
5 .10 15 

Met Tyr Tyr Ala Pro Ser He Ser 
20 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 

<400> 



28 
25 
PRT 

Ehrlichia chaffeensis 

amino acid sequence of N- terminal signal peptide 

of E. chaffeensis P28 

28 



Met Asn Tyr Lys Lys lie Leu He Thr Ser Ala Leu He Ser Leu 

5 10 15 

He. Ser Ser Leu Pro Gly Val Ser Phe Ser 

20 25 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 

<400> 



29 
26 
PRT 

Ehrlichia canis 

amino acid sequence of putative cleavage site of 

ECa28-l 

29 



Met Asn Cys Lys Lys He Leu He Thr Thr Ala Leu He Ser Leu 
5 10 15 

Met Tyr Ser He Pro Ser He Ser Ser Phe Ser 
20 25 



<210> 
<211> 



30 
299 



SEQ 22/24 



WO 00/32745 PCT/US99/28075 

<212> DNA 

<213> Ehrlichia canis 

<220> 

<223> nucleic acid sequence of intergenic noncoding 

region 1 (28NC1) 
<400> 30 

taatacttct attgtacatg ttaaaaatag tactagtttg cttctgtggt 50 
ttataaacgc aagagagaaa tagttagtaa taaattagaa agttaaatat 100 
tagaaaagtc atatgttttt cattgtcatt gatactcaac taaaagtagt 150 
ataaatgtta cttattaata attttacgta gtatattaaa tttcccttac 200 
aaaagccact agtattttat actaaaagct atactttggc ttgtatttaa 250 
tttgtatttt tactactgtt aatttacttt cactgtttct ggtgtaaat 299 

<210> 31 

<211> 345 

<212> DNA 

<213> Ehrlichia canis 

<220> 

<223> nucleic acid sequence of intergenic noncoding 

region 2 (28NC2) 
<400> 31 

taatttcgtg gtacacatat cacgaagcta aaattgtttt tttatctctg 50 
ctgtatacaa gagaaaaaat agtagtgaaa attacctaac aatatgacag 100 
tacaagttta ccaagcttat tctcacaaaa cttcttgtgt cttttatctc 150 
tttacaatga aatgtacact tagcttcact actgtagagt gtgtttatca 200 
atgctttgtt tattaatact ctacataata tgttaaattt ttcttacaaa 250 
actcactagt aatttatact agaatatata ttctgacttg tatttgcttt 300 
atacttccac tattgttaat ttattttcac tattttaggt gtaat 345 

<210> 32 

<211> 345 

<212> DNA 

<213> Ehrlichia canis 

<220> 

<223> nucleic acid sequence of intergenic noncoding 

region 3 (28NC3) 

<400> 32 



SEQ 23/24 



WO00O2745 PCI7US99/28075 

tc,_ttttatt gttgccaeat attaaaaatg atctaaactt gtttttatta 50 
ttgctacata caaaaaaaag aaaaatagtg gcaaaagaat gtagcaataa 100 
gagggggggg ggggactaaa tttaccttct attcttctaa catcctttac 150 
tatattcaaa tagcacaacc caatgcttcc aggaaaatat gctcctaata 200 
ttttatttat taccaatcct tatataatat attaaatttc ccttacaaaa 250 
atctctaatg ttttatactt aatatatata ttctggcttg tatttacttt 300 
gcacttccac tattgttaat ttattttcac tattttaggt gtaat 345 

<210> 33 

<211> 355 

<212> DNA 

<213> Ehrlichia canis 

<220> 

<223> nucleic acid sequence of intergenic noncodihg - 

region 4 (28NC4) 
<400> 33 

taattttatt gttgccaeat attaaaaatg atctaaactt gtttttawta 50 
ttgetacata caaaaaaaga aaaatagtgg caaaagaatg tagcaataag 100 
aggggggggg gggaccaaat ttatcttcta tgcttcccaa gttttttcyc 150 
gctatttatg acttaaacaa cagaaggtaa tatcctcacg gaaaacttat 200 
cttcaaatat tttatttatt accaatctta tataatatat taaatttctc 250 
ttacaaaaat cactagtatt ttataccaaa atatatattc tgacttgett 300 
ttcttctgea cttctactat ttttaattta tttgtcacta ttaggttata 350 
ataaw 355 



SEQ 24/24 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 
Q^FADED TEXT OR DRAWING 
EfBLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

ETcOLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

EfllNES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



